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GBS TOXIN RECEPTOR 



PCT/US99/I6676 



INTRQPUCTI Q N 

Technical Fi^ld 

5 This invention provides compositions and methods relating to GBS toxin 

receptor polynucleotides and polypeptides. The invention relates to a receptor for a 
polysaccharide isolated from a bacterial source. 

Background 

Group B 6-hemolytic Streptococci (GBS) are ubiquitous microorganisms. 
10 GBS is not known to cause any harmful infections in humans except for very young 
babies. GBS pneumonia, also called "early-onset disease", is associated with high 
morbidity and mortahty in newborn infants. 

In a series of studies conducted by Dr. Carl G. Hellerqvist and his associates at 
the Vanderbilt University School of Medicine, Nashville, Tennessee, a polysaccharide 
1 5 GBS toxin was identified. This toxin was determined to be a major factor in the 

complications of GBS pneumonia, and was found to be useful as a therapeutic agent 
in combating tumors though inhibition of vascularization (U.S. Patent No. 5,010,062). 

In addition, as described in U.S. Patent No. 5,858,991 and W098/32453, GBS 
toxin facilitates wound healing in patients by minimizing scarring and accelerating 
20 healing, and reduces wound-related tumor progression. 

W098/32452 and W098/32448 describe the use of GBS toxin as a therapeutic 
agent for treating patients with chronic inflammatory diseases, such as rheumatoid 
arthritis and psoriasis, and for enhancing repair of neural injury. 

Prior to this invention, receptors for GBS toxin had not been identified. The 
25 inventors, believing receptors of GBS toxin to reside on cells in the developing 
vasculature of tissues undergoing angiogenesis in the conditions described above, 
embarked upon a series of experiments resulting in the present invention. 



1. 
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SUMMARY OF THF ^^A/F^ITfn^( 
For the first time, novel receptors for group B p-hemolytic Streptococcus GBS 
toxin (GBS toxin receptor) have been identified. One aspect of the invention provides 
a polypeptide comprising a GBS toxin receptor or polypeptide fragment thereof 
5 Preferred embodiments include mammalian GBS toxin receptors. Also provided is an 
antibody that recognizes GBS toxin receptor or a fragment thereof The polypeptide 
of the invention can be used, inter alia, for the screening of compounds that can be 
used to treat or prevent conditions arising from pathologic or hypoxia-driven 
angiogenesis or neovascularization, such as, for example, cancerous tumors, chronic 
10 inflammatory disease, scarring during wound healing, keloids, neural injury, and 
reperfiision injury. 

Another aspect of the invention provides a polynucleotide encoding a GBS 
toxin receptor or a fragment thereof and a polynucleotide hybridizable to such 
polynucleotide. Preferred polynucleotides are at least 10 bases in length and comprise 

1 5 a nucleic acid sequence encoding, or are complementary to a nucleic acid sequence 
encoding, a mammalian GBS toxin receptor or a polypeptide fragment thereof 

A third aspect of the invention is a complex comprising a GBS toxin bound to 
a mammalian toxin receptor or fragment thereof Also provided is a method of 
forming such complex. The method comprises contacting a GBS toxin with a 

20 polypeptide comprising a mammalian GBS toxin receptor, or fragment thereof that 
can bind GBS toxin, under conditions that permit specific binding of the GBS toxin to 
the polypeptide, and allowing the complex to form. 

Yet another aspect of the invention is a method for purifying a compound that 
binds a GBS toxin receptor. The method comprises providing a polypeptide 

25 comprising a mammalian GBS toxin receptor, or fragment thereof that binds GBS 
toxin, contacting the polypeptide with a sample comprising the compound under 
conditions that allow specific binding of the compound to the polypeptide, and 
separating the bound compound from the remainder of the sample. 

Another aspect of the invention is a method of determining the presence or 

30 absence of GBS toxin in a sample. The method comprises contacting the sample with 
a polypeptide comprising a mammalian GBS toxin receptor, or fragment thereof that 
binds GBS toxin, imder conditions that allow specific binding of GBS toxin to the 
GBS toxin receptor, and determining whether specific binding of GBS toxin has 

2. 
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occurred. Presence of GBS toxin in a sample obtained from a neonate is indicative of 
early onset disease. 

A sixth aspect of the invention is a method for detecting pathologic 
vasculature in a mammalian tissue. The method comprises detecting the presence of a 
5 GBS toxin receptor. The method can be used for detecting or monitoring a variety of 
medical conditions associated with angiogenesis or neovascularization, such as, for 
example, detecting metastasis of a cancerous tumor, or monitoring the margin of a 
tumor in a mammal undergoing a therapy for cancer. 

Another aspect of the invention provides methods for the identification of drug 

1 0 candidates for the treatment of medical conditions characterized by pathologic and/or 
hypoxia-driven angiogenesis or neovascularization. One embodiment is a method for 
identifying a compound that specifically binds a mammalian GBS toxin receptor The 
method comprises combining a test compound with a mammalian GBS toxin receptor, 
or fragment thereof that can bind GBS toxin, under conditions that allow specific 

1 5 binding to occur, and detecting a complex formed between the test compound and the 
polypeptide. Another embodiment is a method for determining cytotoxicity of a test 
chimeric compound. The method comprises exposing a cell expressing a mammalian 
GBS toxin receptor, or fragment thereof that binds GBS toxin, to a test chimeric 
compound comprising a cytotoxic agent coupled to GBS toxin, and detecting signs of 

20 toxicity. Yet another embodiment is a method for identifying an inhibitor of a GBS 
toxin receptor by incubating test cells that express GBS toxin receptor, or a fragment 
thereof, in the presence and absence of a test compound and under conditions in 
which the cells incubated in the absence of the test compound can proliferate or 
migrate, and comparing the proliferation or migration of the test cells incubated in the 

25 presence and absence of the test compound, wherein less proliferation or migration in 
the presence of the test compound is indicative of the test compound being an 
inhibitor of the GBS toxin receptor. An inhibitor of endothelial cell proliferation or 
migration can be identified by the above method, wherein less proliferation or 
migration of test cells in the presence of the test compound is indicative of the test 

30 compound being an inhibitor of endothelial cell proliferation or migration. A 
therapeutic compound for the treatment or prevention of a medical condition 
characterized by pathologic angiogenesis or neovascularization can also be identified 
by the above method, wherein less proliferation or migration of test cells in the 
presence of the test compound is indicative of the test compound being a candidate 

3. 
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therapeutic compound for the treatment or prevention of the medical condition. 

The invention also provides a method for identifying a compound which 
inhibits binding of a GBS toxin to a mammahan GBS toxin receptor. The method 
comprises simulating and selecting the most probable conformations of a mammalian 
5 GBS toxin receptor, designing a chemically modified analog that substantially mimics 
the energetically most probable three-dimensional structure of the polypeptide, 
chemically synthesizing the analog, and evaluating the bioactivity of the analog. Also 
provided is a method for identifying a compound which binds to a mammalian GBS 
toxin receptor. The method comprises simulating and selecting the most probable 
1 0 conformations of a mammalian GBS toxin receptor, deducing the most probable 
binding domains of the polypeptide, designing a compound that would form the 
energetically most probable complexes with the polypeptide, chemically synthesizing 
the compound, and evaluating the bioactivity of the compound. 

Another aspect of the invention is a method for the prevention or treatment of 
15 neonatal onset disease in a human neonate by administering an inhibitor of binding of 
GBS toxin to a human GBS toxin receptor. 

Yet another aspect of the invention is a method for inhibiting pathologic or 
hypoxia-driven endothelial cell proliferation or migration in a mammalian tissue. The 
method comprises specifically binding a molecule to a GBS toxin receptor present on 
20 the surface of at least one cell in the tissue, the molecule being selected from the 
group consisting of a compound that can evoke an inflammatory response when 
bound to a GBS toxin receptor in a mammal, a chimeric compound comprising a 
cytotoxic compound coupled to a compound that specifically binds the GBS toxin 
receptor, an inhibitor of GBS toxin receptor phosphorylation, and an inhibitor of GBS 
25 toxin receptor activity. 

The invention also provides a GBS toxin receptor or fragment thereof, an 
inhibitor of a GBS toxin receptor, or an inhibitor of binding of a GBS toxin to a GBS 
toxin receptor, for use in a method of treatment of the human or animal body or for 
the manufacture of a medicament for the treatment of a medical condition 
30 characterized by pathologic or hypoxia-driven angiogenesis or neovascularization. 
Also provided is a chimeric compound comprising a cytotoxic agent coupled to a 
compound that binds GBS toxin receptor for use in a method of treatment of the 
human or animal body. 

Also provided are pharmaceutical compositions comprising an inhibitor of a 

4. 
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GBS toxin receptor and/or a chimeric compound comprising a cytotoxic agent 
coupled to a compound that binds GBS toxin receptor, and a pharmaceuiically 
acceptable carrier. 

The invention also provides kits comprising a GBS toxin receptor or fragment 
5 and/or reagents for detecting the presence of a GBS toxin receptor or polypeptide 
fragment thereof or the presence of a polynucleotide encoding same. 

BRIEF DESCRTPTTON OF THE PR AWTNHS; 
FIG. 1 depicts a process of rational drug design. 

10 FIGS. 2 A and 2B depict the results of inmiunohistochemical analysis of GBS 

toxin receptor expression in cancerous and normal human ovary tissue, respectively, 
using antibody Pab55 as described in Example 4. 

FIGS. 3A and 3B depict the results of immunohistochemical analysis of GBS 
toxin receptor expression in cancerous and normal himian ovary tissue, respectively, 
15 using antibody Pab57 as described in Example 4. 

FIGS. 4A-4C depict the targeted delivery of a chimeric compound to GBS 
toxin receptor expressed in a cancerous tissue as described in Example 6. 

DESCRIPTION OF SPECTFTr FM BODTMFNT?; 

20 Definitions 

Generally, all technical and scientific terms used herein have the same 
meaning as commonly understood by one of ordinary skill in the art to, which this 
invention belongs. The nomenclature used herein and the laboratory procedures in 
cell culture, molecular genetics, and nucleic acid chemistry and hybridization 

25 described below are those well known and commonly employed in the art. Standard 
techniques are used for recombinant nucleic acid methods, polynucleotide synthesis, 
and microbial culture and transformation (e.g., electroporation, lipofection). 
Enzymatic reactions and purification steps supplied by manufacturers are typically 
performed according to the manufacturer's specifications. The techniques and 

30 procedures are generally performed according to conventional methods in the art and 
various general references (See generally, Sambrook et aL, Molecular Cloning: A 
Laboratory Manual, 2d ed. (1989) Cold Spring Harbor Laboratory Press, Cold Spring 

5. 
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Harbor, N.Y.) which are provided throughout this document. The nomenclature used 
herein and the laboratory procedures in analytical chemistry, organic synthetic 
chemistry, and pharmaceutical formulation described below are those well known and 
commonly employed in the art. Standard techniques can be used for chemical 
5 syntheses, chemical analyses, pharmaceutical formulation and delivery, and treatment 
of patients. As employed throughout the disclosure, the following terms, unless 
otherwise indicated, shall be understood to have the following meanings: 

By "GBS toxin receptor" is meant a proteinaceous molecule capable of 
binding a toxin from Group B P-hemolytic Streptococcus bacteria (GBS toxin), such 

10 as, for example, CMlOl . A GBS toxin receptor is usually found in nature on the 

surface of a cell. Recombinant membrane bound and soluble GBS toxin receptors can 
be produced by laboratory techniques known in the art and described herein. 

The term "isolated polynucleotide'* referred to herein means a polynucleotide 
that has been subjected to manipulation, such that the isolated polynucleotide is no 

15 longer associated with the chromosome or cell that the polynucleotide is normally 
associated with in nature in the same manner as it is normally associated in nature. 
An example of an "isolated polynucleotide" is a polynucleotide of genomic, 
recombinant, or synthetic origin or some combination thereof. 

The term "isolated protein" referred to herein means a protein that is no longer 

20 associated with the cell that the protein is normally associated with in nature in the 
same manner as it is normally associated in nature, such as (1) a protein free of at 
least some other proteins from the same source, (2) a protein expressed by a cell from 
a different species, (3) a protein that does not occur in nature, and (4) a protein 
produced from cDNA, recombinant RNA, or synthetic origin or some combination 

25 thereof. 

The tenn "polypeptide" is used herein as a generic term to refer to native 
protein, fragments, or analogs of a polypeptide sequence. Hence, native protein, 
fragments, and analogs are species of the polypeptide genus. 

The term "naturally occurring" means found in nature. For example, a 
30 polypeptide or polynucleotide sequence that is present in an organism (including 

viruses) found in nature and which has not been intentionally modified by man in the 
laboratory is naturally-occurring. 

The term "operably linked" refers to a juxtaposition wherein the components 
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SO described are in a relationship permitting them to function in their intended 
manner. A control sequence ''operably linked" to a coding sequence is ligated in such 
a way that expression of the coding sequence is achieved under conditions compatible 
with the control sequences. 
5 The term "control sequence" refers to polynucleotide sequences which are 

necessary to effect the expression of coding sequences to which they are ligated. The 
nature of such control sequences differs depending upon the host organism; in 
prokaryotes, such control sequences generally include promoter, ribosomal binding 
site, and transcription termination sequence; in eukaryotes, generally, such control 
10 sequences include promoters and transcription termination sequence. The term 
"control sequences" is intended to include, at a minimum, all components whose 
presence is necessary for expression, and can also include additional components 
whose presence is advantageous, for example, leader sequences and fusion partner 
sequences. 

15 The term "polynucleotide" as referred to herein means a polymeric form of 

nucleotides of at least 10 bases in length, either ribonucleotides or 
deoxyribonucleotides or a modified form of either type of nucleotide. The term 
includes single- and double-stranded forms of DNA. 

The term "oligonucleotide" referred to herein includes naturally occurring, and 

20 modified nucleotides linked together by naturally occurring and non-naturally 

occurring oligonucleotide linkages. An oligonucleotide is usually a polynucleotide 
200 bases or fewer in length. Preferably oligonucleotides are minimally 10 to 60 
bases in length and most preferably 

15-35 bases in minimal length; Oligonucleotides are usually single-stranded, e.g. for 
25 probes; although oligonucleotides may be double-stranded, e.g. for use in the 

construction of a gene mutant. Oligonucleotides of the invention can be either sense 
or antisense oligonucleotides. The term "naturally occurring nucleotides" referred to 
herein includes deoxyribonucleotides and ribonucleotides. The term "modified 
nucleotides" referred to herein includes nucleotides with modified or substituted sugar 
30 groups and the like. The term "oligonucleotide linkages" referred to herein includes 
oligonucleotides linkages such as phosphorothioate, phosphorodithioate, 
phosphoroselenoate, phosphorodiselenoate, phosphoroanilothioate, phoshoraniladate, 
phosphoroamidate, and the like. An oligonucleotide can include a label for detection, 
if desired. 

7. 
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By "complementary" or "complement" is meant that wherever adenine 
appears in a first nucleic acid sequence, thymine or uracil is found in the 
"complementary" sequence and vice versa, and wherever guanine appears in a first 
nucleic acid sequence, cytosine is found in the "complementary" sequence and vice 
versa. 

The term "sequence identity" describes the proportion of base matches 
between two nucleic acid sequences or the proportion of amino acid matches between 
two amino acid sequences, i.e. the degree of identity between two sequences. When 
sequence identity is expressed as a percentage, e.g., 50%, the percentage denotes the 
proportion of exact matches over the length of sequence from a GBS toxin receptor 
sequence that is compared to some other sequence. Various computer aligmnent 
programs can be used to determine sequence identity. In its simplest form, % identity 
is calculated by dividing the number of exact matches between two nucleic acid 
sequences or between two amino acid sequences by the total number of nucleotides or 
1 5 amino acids in the reference sequence. For example, if there are 300 matches 
between sequences 400 amino acids in length, the sequences have 75% identity. 
Uracil and thymine are considered identical when comparing a ribonucleic acid 
sequence with a deoxyribonucleic acid sequence. 

As applied to polynucleotides, the term "substantial identity" means that two 
20 nucleic acid sequences when optimally aligned, such as by the program BLAST 
(Altschul et al.,y. MoL Biol 215:403-410 (1990)), share at least about 85%, 
preferably at least about 90% sequence identity and most preferably 95% or greater 
sequence identity. Whenusingcomputer alignment programs, gaps (in either of the 
two sequences) are permitted to maximize matching; gap lengths of 15 bases or less 
25 are usually used; 6 bases or less are preferred; 2 bases or less are most preferred. 
When using oligonucleotides as probes or in treatments, the sequence identity 
between the target nucleic acid and the oligonucleotide sequence is generally not less 
than 17 target base matches out of 20 possible oligonucleotide base pair matches 
(85%); preferably not less than 9 matches out of 10 possible base pair matches (90%), 
30 and most preferably not less than 1 9 matches out of 20 possible base pair matches 
(95%). 



8. 
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Preferably, bases which are not identical nevertheless are part of a degenerate 
codon that encodes the same amino acid at that amino acid position. Alternatively, 
bases which are not identical preferably are part of a degenerate codon that encodes a 
conservative amino acid substitution for that amino acid position. 
5 As applied to polypeptides, the term "substantial identity" means that two 

peptide sequences, when optimally aligned by the BLAST computer program, share at 
least about 80 percent sequence identity, preferably at least about 86 percent sequence 
identity, more preferably at least about 95 percent sequence identity, even more 
preferably at least about 99 percent sequence identity up to having one amino acid 

10 difference, and most preferably share 100% identity. Gaps (in either of the two 

sequences being matched) are allowed in maximizing matching; gap lengths of 5 or 
less are prefenred with 2 or less being more preferred. Preferably, residue positions 
which are not identical differ by conservative amino acid substitutions. Conservative 
amino acid substitutions refer to the interchangeabihty of residues having similar side 

15 chains. For example, a group of amino acids having aliphatic side chains is glycine, 
alanine, valine, leucine, and isoleucine; a group of amino acids having aliphatic- 
hydroxyl side chains is serine and threonine; a group of amino acids having amide- 
containing side chains is asparagine and glutamine; a group of amino acids having 
aromatic side chains is phenylalanine, tyrosine, and tryptophan; a group of amino 

20 acids having basic side chains is lysine, arginine, and histidine; and a group of amino 
acids having sulfur-containing side chains is cysteine and methionine. Preferred 
conservative amino acid substitution groups are: valine-leucine-isoleucine, 
phenylalanine-tyrosine, lysine-arginine, alanine- valine, glutamic-aspartic, and 
asparagine-glutamine. " ' ^ - 

25 The term "hybridizable under high stringency conditions" refeired to herein 

means capable of specific binding under conditions whereby only nucleic acid 
sequences having a substantial identity of greater than 95% with respect to each other 
will hybridize. These conditions are known in the art and discussed herein. 

The term "degenerate codon" means any of the nucleotide codon triplets 

30 encoding a desired amino acid according to the genetic code. Codons can be selected 
based upon known preferred codon usage in a host organism such as E, colL 

The term "polypeptide fragment" as used herein refers to a polypeptide that 
has an amino-terminal and/or carboxy-terminal deletion, but where the remaining 
amino acid sequence is identical to the corresponding positions in the naturally- 
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occurring sequence deduced, for example, from a full-length DNA sequence. 
Fragments typically are at least 3 amino acids long, preferably are 5-10 amino acids 
long, more preferably are 10-50 amino acids long, even more preferably are more 
than 50 amino acids long and comprise at least one extracellular domain of a GBS 
5 toxin receptor. Most preferred are fragments that comprise the entire extracellular 
domains of a GBS toxin receptor, and preferably also comprise portions of 
transmembrane and intracellular domains sufficient to maintain the polypeptide 
fragment in a functional stereochemical conformation on the surface of a cell, lipid 
membrane, liposome, micelle, or other lipophilic structure. 
10 The term "immunologically reactive" means having antigenic properties or 

being capable of being specifically bound by an antibody that can specifically bind 
GBS toxin receptor. A substance has antigenic properties if it can generate 
monoclonal or polyclonal antibodies when administered to an animal under conditions 
known in the art to facilitate the production of antibodies that will recognize and bind 
1 5 a particular antigen, 

A "heterologous polypeptide" is a polypeptide different from polypeptides 
normally produced by a particular cell. For example, a GBS toxin receptor 
polypeptide or fragment thereof that is produced recombinantly in a cell that does not 
normally produce such GBS toxin receptor polypeptide or fragment thereof, is a 
20 heterologous polypeptide. A second polypeptide joined to a GBS toxin receptor 

polypeptide or fragment thereof is also a heterologous polypeptide if it is not joined to 
a GBS toxin receptor polypeptide in nature. 

As used herein, the terms "label" or "labeled" refers to incorporation of a 
detectable marker, e.g., by incorporation of a radiolabeled amino acid or attachment to 
25 a polypeptide of biotinyl moieties that can be detected by marked avidin (e.g., 

streptavidin containing a fluorescent marker or enzymatic activity that can be detected 
by optical or colorimetric methods). Various methods of labeling polypeptides and 
glycoproteins are known in the art and may be used. Examples of labels for 
polypeptides include, but are not limited to, the following: radioisotopes (e.g., ^H, ^"^0, 
30 ^^S, fluorescent labels (e.g., FITC, rhodamine, lanthanide phosphors), 

enzymatic labels (e.g., horseradish peroxidase, fi-galactosidase, luciferase, alkaline 
phosphatase), chemiluminescent, biotinyl groups, predetermined polypeptide epitopes 
recognized by a secondary reporter (e.g., leucine zipper pair sequences, binding sites 
for secondary antibodies, metal binding domains, epitope tags). In some 

10. 
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embodiments, labels are attached by spacer arms of various lengths to reduce 
potential steric hindrance. 

The term ^'compound'' as used herein preferably refers to a peptidic, 
peptidomimetic, organic, or other chemical molecule and also refers to a nucleic acid 
molecule or chemical derivative thereof The compound can interact with, or be, the 
polynucleotides or polypeptides of the invention. 

The singular forms "a", "an" and "the" include plural referents unless the 
context clearly dictates otherwise. 

The SEQ ID NOs of the nucleic acid and amino acid sequences described 
herein are summarized below in Table 1. 



Table 1 

Nucleic Acid and Amino Acid Sequences 



SEQ ID NO: 


Type of Sequence 


Description 


SEQ ID NO: 1 


nucleic acid 


Partial human GBS toxin receptor (HP55) 


SEQ ID NO: 2 


amino acid 


Partial human GBS toxin receptor (HP55) 


SEQ ID NO: 3 


nucleic acid 


Sheep GBS toxin receptor (SP55) 


SEQ ID NO: 4 


amino acid 


Sheep GBS toxin receptor (SP55) 


SEQ ID NO: 5 


nucleic acid 


Primer 


SEQ ID NO: 6 


nucleic acid 


Primer 


SEQ ID NO: 7 


nucleic acid 


Full-length human GBS toxin receptor (HP59) 


SEQ ID NO: 8 


amino acid 


Full-length human GBS toxin receptor (HP59) 


SEQ ID NO: 9 


nucleic acid 


Human/Sheep consensus GBS toxin receptor 
coding region 

(with base codes a, c, g, t, m, r, w, s, y, k) 


SEQ ID NO: 10 


amino acid 


Human/Sheep consensus GBS toxin receptor 
coding region (translation of SEQ ID No: 9) 


SEQ ID NO: 11 


nucleic acid 


Human/Sheep consensus GBS toxin receptor 
coding region - . . _ 
(with base codes a, c, g, t, n) 


SEQ ID NO: 12 


amino acid 


Human/sheep consensus GBS toxin receptor 
coding region (translation of SEQ ID NO* 1 1 ) 



The headings provided herein describe the general topic discussed and are not 
1 5 intended to be exclusive of information discussed in other sections. Frequently, 
infoimation, methods, compositions, and other aspects may be applicable to more 
than one embodiment of the invention and can be so combined. 



Introduction 

GBS toxin binds to tissues undergoing pathologic, hypoxia-driven, and 

11. 
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embryologic angiogenesis or neovascularization. The inventors have identified at 
least two mammalian GBS toxin receptors, which are described herein. Examples 1 
and 2 describe the cloning and characterization of some GBS toxin receptors. The 
inventors have classified GBS toxin receptor as an integral protein with seven 
5 transmembrane domains. The predicted segments are shown in Table 7. The protein 
has several putative sites for phosphorylation by cAMP-dependent kinase, protein 
kinase C (PKC), and casein kinase II (CK2). Typically, such integral proteins, upon 
binding of a molecule (e.g., a ligand or an extracellular messenger), undergo a 
conformational change which facilitates phosphorylation at phosphorylation sites such 

10 as those discussed above. The phosphorylation of the protein at these sites may 
trigger a signal transduction cascade, which often results in proliferation or other 
nuclear responses of the cells which have been exposed to the binding molecule. 
Angiogenesis or neovascularization involves proliferation and migration of 
endothelial cells. As discussed in greater detail in Examples 4 and 5, GBS toxin 

15 receptor expression is correlated with medical conditions involving pathologic, 
hypoxia-driven, and embryogenic angiogenesis or neovascularization. GBS toxin 
receptor polypeptides can be used for a variety of purposes, including screening for 
compounds that can inhibit endothelial cell proliferation and/or migration mediated by 
GBS toxin receptor and screening for cytotoxic chimeric compounds that can bind to 

20 and destroy cells expressing GBS toxin receptor. GBS toxin receptor polynucleotides 
can be used for a variety of purposes, including the design of antisense 
polynucleotides that can block translation of messenger RNA encoding GBS toxin 
receptor. 

25 Polynucleotides 

One aspect of the invention provides for isolated polynucleotides at least ten 
bases in length encoding or complementary to a nucleic acid sequence encoding a 
GBS toxin receptor or a fragment derived therefrom. Preferably, the GBS toxin 
receptor is a mammahan GBS toxin receptor, more preferably an ovine, bovine or 

30 feline GBS toxin receptor, and most preferably a human GBS toxin receptor. The 
isolated polynucleotides can be naturally occurring or non-naturally occurring. The 
isolated polynucleotides can comprise a DNA sequence or an RNA sequence in which 
every T is replaced with U. For purposes of determimng percentage identity, T is 
considered equivalent to U. Preferably, the polynucleotides include alleles of an 

12. 
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ovine, bovine, feline or human GBS toxin receptor, and can include alleles of GBS 
toxin receptor of other mammals. These polynucleotides can be isolated using 
polynucleotides derived from SEQ ID NOs: 1. 3, 7, 9 and 1 1, as described further 
below. 

5 Polynucleotides, oligonucleotides and fragments of the invention selectively 

hybridize to nucleic acid strands under hybridization and wash conditions that 
minimize appreciable amounts of detectable binding to nonspecific nucleic acids. The 
polynucleotides can be hybridizable under high stringency conditions to a nucleic acid 
molecule having a nucleic acid sequence comprising at least 20 contiguous 

1 0 polynucleotides, preferably at least 30 contiguous nucleotides of SEQ ID NO: 1 or 
SEQ ID NO: 3, and even more prefereably to the nucleic acid sequence of SEQ ID 
NO: 1, 3, 7, 9 or 1 1 or the complement of SEQ ID NO: 1, 3, 7, 9 or 11 . Such 
polynucleotides can be used for performing selective, high stringency hybridization 
and are particulariy useful for performing amplification of nucleic acid by polymerase 

1 5 chain reaction (PGR) to determine the presence or absence of GBS toxin receptor in a 
sample, for isolating a naturally occurring nucleic acid encoding a GBS toxin receptor 
(see Example 3), as antisense molecules for blocking translation of GBS toxin 
receptor mRNA. Particularly preferred are polynucleotides hybridizable under high 
stringency conditions to a nucleic acid molecule having a nucleic acid sequence 

20 comprising the nucleic acid sequence of nucleotides 266 to 1870 of SEQ ID NO: 7 
(the putative fiill length coding region of a human GBS toxin receptor, excluding the 
start codon), nucleotides 266 to 1870 of SEQ ID N0;7 (the putative full length coding 
region of a human GBS toxin receptor, including the start codon), nucleotides 61 to 
1 542 of SEQ ID NO: 1 (the partial coding region of a human GBS toxin receptor,- 

25 excluding the start codon), nucleotides 58 to 1542 of SEQ ID NO: 1 (the partial 

coding region of a human GBS toxin receptor, including the start codon), nucleotides 
87 to 1568 of SEQ ID NO: 3 (the coding region of a sheep GBS toxin receptor, 
excluding the start codon), nucleotides 84 to 1568 of SEQ ID N0:3 (the coding region 
of a sheep GBS toxin receptor, including the start codon), or a complementary nucleic 

30 acid sequence thereof 

The polynucleotides can have an identity to the nucleic acid sequence of a 
corresponding region of SEQ ID NO: 1 , 3 or 7 or the complement of a corresponding 
region of SEQ ID NO: 1, 3 or 7 in the range of about 85% to 100%, preferably greater 
than about 87% identity, more preferably greater than about 95% identity, and most 
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preferably about 99% to 100% identity. Particularly preferred are polynucleotides 
comprising the nucleic acid sequence of nucleotides 266 to 1 870 of SEQ ID NO: 7, or 
nucleotides 87 to 1568 of SEQ ID NO: 3, SEQ ID NO: 9, SEQ ID NO: 11, or a 
complementary nucleic acid sequence thereof. 
5 Preferably, the polynucleotides comprise a nucleic acid sequence encoding, or 

complementary to a nucleic acid sequence encoding, a polypeptide having an identity 
to the amino acid sequence of a fragment of a GBS toxin receptor in the range of 
about 85% to 100%, more preferably greater than 86% identity, even more preferably 
greater than 95% identity, and most preferably 99% to 100% identity. Preferably, the 
10 fragment binds GBS toxin. Preferred fragments comprise all or a portion of residues 
1 to 495 of SEQ ID NO: 2 or all or a portion of residues 1 to 536 of SEQ ID NO: 8, 
Particularly preferred are polynucleotides comprising a nucleic acid sequence 
encoding a polypeptide having 100% identity to the amino acid sequence of residues 
1 to 495 of SEQ ID NO: 4, residues 1 to 495 of SEQ ID NO: 2, or residues 1 to 536 of 
15 SEQIDN0:8. 

Polynucleotides encoding naturally occurring GBS toxin receptor can be 
isolated from various tissue sources and cell cultures from different species that 
produce such a receptor by the methods described herein, such as, for example, cells 
from tumor endothelium, synovial tissue in rheumatoid arthritis, or hypoxic tissue 
20 deprived of or restricted from blood flow, such as in reperfusion injury or wounded 
tissue. Such polynucleotides can be isolated by hybridization using probes or by 
polymerase chain reaction using oligonucleotides, as well as by implementing other 
molecular biology techniques known in the art. Such probes and oligonucleotides 
typically comprise various regions of the sequence of SEQ ID NO: 1, 3, 7, 9 or 11, 
25 preferably of SEQ ID NO: 1, 3, or 7, or encode various regions of the sequence of 
SEQ ID NO. 2, 4, 8,10 or 12, preferably of SEQ NO: 2, 4 or 8. 

Polynucleotides useful for cloning genes encoding GBS toxin receptors of 
various organisms can be determined by comparing the amino acid sequences of 
homologous proteins, (see Table 4). For example, conserved regions can be targeted 
30 for the synthesis of oligonucleotides or degenerate oligonucleotides to be used as 
probes for hybridization or nucleic acid amplification, techniques discussed further 
below and in Example 3. Stringency can be varied to achieve selective hybridization 
conditions whereby nucleic acid sequences having less than 95% identity with respect 
to each other will hybridize. These conditions are known in the art and discussed 

14. 
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herein and examples are provided. Generally, the nucleic acid sequence identity 
between the polynucleotides, oligonucleotides, and fragments of the invention and a 
nucleic acid sequence of interest will be at least about 85%, and more typically with 
preferably increasing identities of at least about 90%, 95%, 99%, and 100%. 
5 Polynucleotides can be used as probes under high stringency wash conditions 

and with corresponding hybridization conditions, as known in the art. Small 
polynucleotides, for example, polynucleotides 200 bases or fewer in length, are often 
referred to in the art as oligonucleotides. Techniques for using polynucleotides as 
probes to detect the same or related nucleic acid sequences is well known in the art. 
1 0 See, for example, Sambrook et al, especially Chapter 1 1 , the text of which is herein 
incorporated by reference. Usually, probes can be made from polynucleotides that are 
10 to 200 bases in length. Preferably probes are made from polynucleotides 10 to 60 
nucleotides in length and most preferably 12 to 40 bases in length. Specific probes 
can be designed based on results obtained using nucleic acid homology computer 
1 5 programs such as FASTA, which uses the method of Pearson and Lipman (Proc. Natl. 
Acad. ScL USA 85:2444-2448 (1988)) and shows the degree of identity between 
compared sequences. The size of the probe is dependent upon the region of the gene 
to which it will be hybridized. The size of the probe increases as the degree of 
homology to undesirable nucleic acid sequences increases. A probe 10-50 nucleotides 
20 in length can be used, preferably more than 50 nucleotides, even more preferably 
more than 100 nucleotides, and most preferably a probe made from the entire coding 
region of a GBS toxin receptor will be used. To decrease the number of false 
positives, preferably two probes are used to identify clones that bind to both probes 
under hybridization and wash conditions. Oligonucleotides can be synthesized on an 
25 Applied BioSystems oligonucleotide synthesizer according to specifications provided 
by the manufacturer. 

Typically, hybridization and washing conditions are performed at according to 
conventional hybridization procedures. Typical hybridization conditions for 
screening plaque lifts (Benton and Davis (1978) Science 196: 1 80) can be: 50% 
30 formamide, 5 x SSC (sodium chloride, sodium citrate) or SSPE (sodium chloride, 
sodium phosphate, EDTA), 1-5 x Denhardt's solution, 0.1-1% SDS, 100-200 /jg^ 
sheared heterologous DNA or tRNA, 0-10% dextran sulfate, 1 x 10^ to 1 x 10^ 
cpm/ml of denatured probe with a specific activity of about 1x10^ cpm///g, and 
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incubation at 42^C for about 6-36 hours. Prehybridizaiion conditions are essentially 
identical except that probe is not included and incubation time is typically reduced. 
Washing conditions are typically 1-3 x SSC, 0.1-1% SDS, 42-70°C with change of 
wash solution at about 5-30 minutes. Cognate bacterial sequences, including allelic 
5 sequences, can be obtained in this manner. For high stringency hybridization 
conditions, various parameters can be altered to increase the stringency of 
hybridization, such as by increasing the temperature of incubation with the labeled 
probe. Preferably, for greater flexibility in experimental design, the probe can be 
hybridized at a lower temperature, such as, for example, room temperature and the 

1 0 stringency can then be modified by altering the salt concentration and temperature of 
the wash solutions. For high stringency a wash temperature of greater than or equal to 
42°C can be used, such as, for example, 68^C, in a wash buffer having a salt 
concentration less than 3X SSC, such as, for example, O.IX SSC. In some cases, 
TMACL can also be used, particulariy for polynucleotides rich in G-C base pairs in 

15 order to decrease non-specific binding, A lower stringency wash can be used to 
hybridize polynucleotides with lower identities or polynucleotides that are less than 
60 base pairs in length. For a low stringency wash, temperatures of less than or equal 
to 42° can be used in a wash buffer having a salt concentration of greater than or 
equal to 2X SSC. 

20 The invention includes methods for amplification of target nucleic acids, such 

as the polymerase chain reaction ("PCR") technique. The PCR technique can be 
applied to identify related sequences in the genomes of various organisms and to 

detect nucleotide sequences in suspected samples, using oligonucleotide primers 
spaced apart fi-om each other and based on the genetic sequence set forth herein. The 

25 primers are complementary to opposite strands of a double-stranded DNA molecule 
and are typically separated by from about 50 to 450 nucleotides or more (usually not 
more than 2000 nucleotides). This method entails preparing the specific 
oligonucleotide primers followed by repeated cycles of target DNA denaturation, 
primer binding, and extension with a DNA polymerase to obtain DNA fi-agments of 

30 the expected length based on the primer spacing. Extension products generated fi-om 
one primer serve as additional target sequences for the other primer. The degree of 
amplification of a target sequence is controlled by the number of cycles that are 
performed and is theoretically calculated by the simple formula 2n where n is the 
number of cycles. Given that the average efficiency per cycle ranges fi-om about 65% 
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to 85%, 25 cycles produce from 0.3 to 4.8 million copies of the target sequence. The 
PGR method is described in a number of publications, including Saiki et al., Science 
(1985) 230:1350-1354; Saiki et al.. Nature (1986) 324:163-166; and Scharf et al.. 
Science (1986) 233:1076-1078. Also see U.S. Patent Nos. 4,683,194; 4,683,195; and 
5 4,683,202, the text of each patent is herein incorporated by reference. Additional 
methods for PGR amplification are described in: PGR Technology: Principles and 
Applications for DNA Amplification ed. HA Erlich, Freeman Press, New York, NY 
(1992); PCR Protocols: A Guide to Methods and Applications, eds. Innis, Gelfland, 
Snisky, and White, Academic Press, San Diego, GA (1990); Mattila et al. (1991) 

10 Nucleic Acids Res, 19: 4967; Eckert, K.A. and Kunkel, T.A. (1991) PCR Methods and 
Applications 7:17, and; PCR, eds. McPherson, Quirkes, and Taylor, IRL Press, 
Oxford, all of which are incorporated herein by reference. 

In yet another embodiment, an antisense polynucleotide can be administered to 
a mammal to treat or prevent a medical condition involving pathologic and/or 

1 5 hypoxia-driven angiogenesis. The antisense oligonucleotides of the invention can be 
synthesized by any of the known chemical oligonucleotide synthesis methods. Such 
methods are generally described, for example, in Winnacker, From Genes to Glones: 
Introduction to Gene Technology. VGH Verlagsgesellschaft mbH (H., Ibelgaufts 
trans. 1987). Any of the known methods of oligonucleotide synthesis can be utilized 

20 in preparing the instant antisense oligonucleotides. The antisense oligonucleotides are 
most advantageously prepared by utilizing any of the commercially available, 
automated nucleic acid synthesizers. The device utilized to prepare the 
oligonucleotides described herein, the Applied Biosystems 3 SOB DNA Synthesizer, 
utilizes -cyanoethylphosphoramidite chemistry. Antiisense oligonucleotides 

25 hybridizable with any portion of the mRNA transcript can be prepared by the 

oHgonucleotide synthesis methods known to those skilled in the art. While any length 
oligonucleotide can be utilized in the practice of the invention, sequences shorter than 
12 bases may be less specific in hybridizing to the target GBS toxin receptor mRNA, 
and may be more easily destroyed by enzymatic digestion. Hence, oligonucleotides 

30 having 12 or more nucleotides are preferred. Sequences longer than 18 to 21 
nucleotides may be somewhat less effective in inhibiting GBS toxin receptor 
translation because of decreased uptake by the target cell. Thus, oligomers of 12-21 
nucleotides are most preferred in the practice of the present invention, particularly 
oligomers of 12-18 nucleotides. Oligonucleotides complementary to and hybridizable 
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with any portion of the GBS toxin receptor mRNA transcript are, in principle, 
effective for inhibiting translation of the transcript, and capable of inducing the effects 
herein described. Translation is most effectively inhibited by blocking the mRNA at a 
sife at or near the initiation codon. Thus, oligonucleotides complementary to the 5' 
5 region of the GBS toxin receptor mRNA transcript are preferred. Secondary or 
tertiary structure which might interfere with hybridization is minimal in this region. 
Moreover, sequences that are too distant in the 3' direction from the initiation site can 
be less effective in hybridizing the mRNA transcripts because of a "read-through" 
phenomenon whereby the ribosome is postulated to unravel the antisense/sense 

10 duplex to permit translation of the message, (see, e.g. Shakin, J. Biochemistry 261, 
16018 (1986)). The antisense oligonucleotide is preferably directed to a site at or near 
the ATG initiation codon for protein synthesis. Oligonucleotides complementary to a 
portion of the GBS toxin receptor mRNA including the initiation codon are preferred. 
While antisense oligomers complementary to the 5' region of the GBS toxin receptor 

15 transcript are preferred, particularly the region including the initiation codon, it should 
be appreciated that useful antisense oligomers are not limited to those complementary 
to the sequences found in the translated portion of the mRNA transcript, but also 
includes oligomers complementary to nucleotide sequences contained in, or extending 
into, the 5' and 3' untranslated regions. Antisense nucleotides or antisense expression 

20 constructs can find use to treat or prevent diseases associated with pathologic or 

hypoxia-driven angiogenesis and neovascularization, as inappropriate expression of 
GBS toxin receptor results in hyperproliferation of endothelial cells. 

In one embodiment, the polynucleotides of the invention can exist in linear 
form. In another embodiment, the polynucleotides can exist in circular form as part of 

25 a plasmid. 

In yet another embodiment, a probe or PGR primer comprises a group of 
polynucleotide species containing different degenerate codons at various positions, 
which polynucleotides encode, or are complementary to sequences encoding, a GBS 
toxin receptor in whole or in part. Such polynucleotides can be useful for isolating 
30 nucleic acid sequences encoding polypeptides having at least about 85% identity to 
the amino acid sequence of sheep or human GBS toxin receptor, such as, for example, 
GBS toxin receptors of other organisms. Typically, such polynucleotides are 
synthesized chemically as described above by programming a synthesizer to 
incorporate a particular combination of nucleic acid residues at a certain position. 
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Table 2 
Base Codes 



Symbol Meaning 

A A; adenine 

C C; cytosine 

G G; guanine 

T T; thymine 

U U; uracil 

M AorC 

R AorG 

W AorT/U 

S C or G 

Y C or T/U 
K GorT/U 

V A or C or G; not T/U 
H A or C or T/U; not G 
D A or GorT/U; note 
B Cor GorT/U; not A 
N A or C or G or T/U 
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Polypeptides 

Another aspect of the invention provides polypeptides comprising (1) the full 
length GBS toxin receptor protein or a naturally occurring allelic variant thereof, (2) 
fragments of at least 3 amino acids of the amino acid sequence of SEQ ID NO: 2, 4, 8, 
5 1 0 or 1 2, and (3) a GBS toxin receptor protein, polypeptide, or polypeptide fragment 
having an amino acid identity in the range of about 80% to 100% to the amino acid 
sequence of a corresponding region of SEQ ID NO: 2, 4 or 8. Preferred fragments of 
the amino acid sequence of SEQ ID NO: 2, 4, 8, 10 or 12, are at least 5, 6, 7, 8 or 9 
amino acids in length and are immunologically reactive, i.e., immunogenic. More 
1 0 preferred are fragments at least 25 amino acids in length and fragments comprising 
the amino acid sequence of residues 181 to 419 of SEQ ID NO: 2 or residues 1 to 240 
of SEQ ID NO: 4. Most preferred are fragments that can bind GBS toxin. Preferably, 
the GBS toxin receptor protein, polypeptide, or polypeptide fragment has an amino 
acid identity to the amino acid sequence of a corresponding region of SEQ ID NO: 2, 

15 4 or 8 of at least about 86%, more preferably at least about 95% identity, even more 
preferably at least about 99% identity up to having one amino acid difference, and 
most preferably 100% identity. Preferred polypeptides have at least about 89% 
identity, more preferably at least about 95% identity, even more preferably at least 
about 99% identity up to having one amino acid difference, and most preferably 100% 

20 identity to the amino acid sequence of residues 181 to 419 of SEQ ID NO: 2, residues 
1 to 495 of SEQ ID NO: 4. Preferably, a full length GBS toxin receptor protein 
comprises the amino acid sequence of residues 1 to 495 of SEQ ID NO: 2, residues 1 
to 495 of SEQ ID NO: 4, or residues 1 to 536 of SEQ ID NO: 8, or an allelic variant 
thereof. The polypeptides of the invention can include amino acids in addition to the 

25 GBS toxin receptor protein, polypeptide, or polypeptide fragment. Such polypeptides 
typically comprise a heterologous polypeptide joined to a second polypeptide derived, 
as described above, from a GBS toxin receptor. Preferably the additional amino acids 
are covalently linked to the amino-terminal or carijoxy-teiminal tenninus of the GBS 
toxin receptor protein, polypeptide, or polypeptide fragment. 

30 Fragments or analogs of GBS toxin receptor can be prepared by those of 

ordinary skill in the art. Preferred amino- and carboxy-termini of fragments or 
analogs occur near boundaries of functional domains. For example, such functional 
domains include domains conferring the property of induction of an inflammatory 
response upon binding of GBS toxin to the GBS toxin receptor. GBS toxin mediates 
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the binding and opsonization by C3 of endothelial cells that express the GBS toxin 
receptor. Such domains can comprise the binding site for GBS toxin, in whole or in 
part, or domains otherwise essential for GBS toxin receptor structure and/or function. 
Preferably, computerized comparison methods are used to identify sequence motifs or 
5 predicted protein conformation domains that occur in other proteins of known 
structure and/or function. Methods to identify protein sequences that fold into a 
known three-dimensional structure are known (Bowie et al. ( 1 991) Science 253: 1 64). 
Computerized prediction methods, such as, for example, a hydropathy profile as 
provided by the "Soap" program in PC/GENE can be employed to identify putative 
1 0 structural and functional domains. Using the method of Klein, Kanehisa and DeLise, 
Biochim Biophys Acta (1985) 815:468-476, the inventors have classified a sheep 
GBS toxin receptor, SP55, as an integral protein with seven transmembrane segments 
predicted. Such a protein is also known colloquially in the art as a *7-spanner". The 
predicted segments are set forth below in Table 3. 

15 Tables 



Predicted Transmembrane Domains of SPSS 





Inner Boundaries 


Outer Boundaries 


Segment 
Sequence 


P:I odds* 


No. 


From 


To 


From 


To 


1 


232 


248 


226 


252 


FFGIVGIIWFILWICLV 
(232-248 ofSEQIDNo.4) 


2.589323E-05 


2 


369 


385 


365 


389 


LIGMIGPAIFLVAAGFI 
(369-385 ofSEQ ID No. 4) 


1.00731 lE-03 


3 


458 


474 


456 


479 


TVFCIAAAINVFGAIFF 
(458-474ofSEQIDNo.4) 


2.482542E-03 


4 


137 


153 


135 


157 


LLLGFGIFATAIFTLFT 
(137-153 ofSEQ IDNo.4) 


7.564906E-03 


5 


42 


58 


42 


58 


LAFLSFFGFFVLYSLRV 
(42-58ofSEQlDNo.4) 


8.236557E-02 


6 


328 


344 


328 


345 


GFLSAVPYLGCWLCMI 
L (328-344 ofSEQIDNo. 
4) 


.1925022 


7 

■ * 


390 


406 


390 


407 


SLAVAFLTISTTLGGFC 
(390-406 ofSEQIDNo. 4) 


.8064944 



* Relates hydrophobicity of integral sequence to the hydrophobicity of the peripheral 
sequence. An integral sequence with a higher hydrophobicity number is more likely 
to be part of a transmembrane domain. 



A computerized alignment of the amino acid sequences of GBS toxin receptor 
in various organisms provides further guidance in preparing preferred fragments. See, 
for example, Table 4 which compares the amino acid sequence of residues 42 to 536 
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of a human GBS toxin receptor (HP59) (residues 42 to 536 of SEQ ID NO: 8) and a 
sheep GBS toxin receptor (SP55). 

Table 4 

Alignment of Human and Sheep GBS Toxin Receptor Amino Acid 
5 Sequences 

SP55 MKSPVSDLAPSDGEEGSDRTPLLQRAPRAEPAPVCCSARYNLAFLSFFGF 50 

I Ml III MM llllll MM! IIIIMIIIMi I III! 

HP55 MRSPVRDLARNDGEESTDRTPLLPGAPRAEAAPVCCSARYNLAILAFFGF 50 
SPSS FVLYSLRVNLSVALVDMVDSNTTAKDNRTSYECAEHSAPIKVLHNQTGKK 100 

I I M llllll III III MM Mill I llllllll i Mill I 

HPS5 FIVYALRVNLSVALVDMVDSNTTLEDNRTSKACPEHSAPIKVHHNQTGKK 100 

SPSS YRWDAETQGWILGSFFYGYIITQIPGGYVASRSGGKLLLGFGIFATAIFT 150 

I IIMIMIIIIIIIIIIIIIIIIIIIIM III llllll II I 

HP55 YQWDAETQGWILGSFFYGYIITQIPGGYVASKIGGKMLLGFGILGTAVLT ISO 

SPSS LFTPLAADFGVGALVALRALEGLGEGVTYPAMHAMWSSWAPPLERSKLLS 200 

INI 11! Ill I illlllllMII Mill llllllllll llllll 

HPS 5 LFTPIAADLGVGPLIVLRALEGLGEGVTFPAMHAMWSSWAPPLERSKLLS 200 
SPSS ISYAGAQLGTWSLPLSGVICYYMNWTYVFYFFGIVGIIWFILWICLVSD 250 

„„„ II I MM MM llllll Mill Mil llllll II II Ml MM 

HPS5 ISYAGAQLGTVISLPLSGIICYYMNWTYVFYFFGTIGIFWFLLWIWLVSD 250 
SPSS TPETHKTITPYEKEYILSSLKNQLSSQKSVPWIPMLKSLPLWAIWAHFS 300 

II III IlillllMI MMMIIIM I MMMMMIMM 

HP5S TPQKHKRISHYEKEYILSSLRNQLSSQKSVPWVPILKSLPLWAIWAHFS 300 
SPSS YNWTFYTLLTLLPTYMKEVLRFNIQENGFLSAVPYLGCWLCMILSGQAAD 350 

MIMIMMMMMM MM MIIIM MM IIIIIIIIIMI 

HP55 YNWTFYTLLTLLPTYMKEILRFNVQENGFLSSLPYLGSWLCMILSGQAAD 350 
SPSS NLRARWNFSTLWVRRVFSLIGMIGPAIFLVAAGFIGCDYSLAVAFLTIST 400 

■^^^ -nil IMIII lll-lllll|.|MI.MIIIIM.||JIMMIMI^ 

HPSS NLRAKWNFSTLCVRRIFSLIGMIGPAVFLVAAGFIGCDYSIiAVAFLTIST 400 
SPSS TLGGFCSSGFSINHLDIAPSYAGILiiGITNTFATIPGMIGPIIARSLTPE 450 

II MIIIIIIIMII II llllll MMMMMIMM II II MM 

HPSS TLGGFCSSGFSINHLDIAPSYAGILLGITNTFATIPGMVGPVIAKSLTPD 450 
SPSS NTIGEWQTVFCIAAAINVFGAIFFTLFAKGEVQNWAISDHQGHRN 4 95 

II IIIIIM II Ml I II II Mill II II II III I II Ml 

HP55 NTVGEWQTVFYIAAAINVFGAIFFTLFAKGEVQNWALNDHHGHRH 4 95 

HP55-SEQIDNO:2 
SP55-SEQroNO:4 

Thus, the foregoing examples demonstrate that those of skill in the art can recognize 
sequence motifs and structural conformations that may be used to define structural 
1 0 and functional domains in a GBS toxin receptor sequence. 

22. 
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Although one class of preferred embodiments are fragments having amino- 
and/or carboxy-termini corresponding to amino acid positions near functional 
domains borders, alternative fragments may be prepared. The choice of the amino- 
and carboxy-termini of such fragments rests with the discretion of the practitioner and 
5 will be made based on experimental considerations, such as ease of construction, 
stability to proteolysis, thermal stability, immunological reactivity, amino- or 
carboxyl-temiinal residue modification, or other considerations. Polypeptide 
fragments usually contain at least nine amino acids and can contain any number of 
amino acids provided that the peptide fragment is at least about 80% identical to the 
1 0 corresponding fragment of SEQ ID NO: 2, SEQ ID NO: 4, or SEQ ID N0:8. The 

human GBS toxin receptor has 41 additional amino acids on the N-terminus compared 
to the sheep GBS toxin receptor (compare SEQ ID NO:4 and SEQ ID N0:8). 
Analogs can comprise additions or deletions of some or all of those 41 N-terminal 
amino acids. N-terminal and C-terminal additions usefiil, e.g., for purification and/or 
15 antibody recognition are also contemplated. Examples include histidine tags, a FLAG 
(phenylalanine, leucine, alanine, guanine) epitope, fusion partners such as glutathione 
S transferase, chloramphenicol acetyltransferase (CAT), luciferase, p-galactosidase, 
and the like. Deletions of unconserved amino acids are also contemplated, provided 
that the structural integrity and/or binding properties of the GBS toxin receptor are not 
20 substantially compromised. 

Analogs can also comprise amino acid substitutions, preferably conservative 
substitutions. Also preferred are conservative and/or non-conservative substitutions 
in regions having less shared identity among various species. For example, a variant 
of a GBS toxin receptor can comprise conservative and/dr non-conservative - 
25 substitutions of amino acids corresponding to residues 2, 6, 10, 1 1, 16, 17, 24, 31, 44, 
46, 52, 53, 55, 74, 75, 81, 82, 84, 93, 102, 132, 133, 137, 144, 145, 148, 149, 155, 
159, 163, 165, 166, 179, 212, 219, 235, 236, 239, 242, 246, 253, 254, 257, 259, 260, 
271, 283, 285, 319, 324, 332, 333, 338, 355, 362, 366, 377, 439, 442, 445, 450, 453, 
461, 487, 488, 491 and 495 of SEQ ID N0:4. Preferably the substitution is an amino 
30 acid present in the corresponding position of SEQ ID NO:4 or SEQ ID NO:8. For 
example, referring to the alignment plot in Table 4, the amino acid corresponding to 
position 152 of SEQ ID N0:4 can be arginine (R), glutamine (Q), or a conservative or 
non-conservative substitution of R or Q, and preferably is R or Q. Such regions can 
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be identified by amino acid sequence alignment plots, such as that shown in Table 4. 
Preferred amino acid substitutions are those which: (1) reduce susceptibility to 
proteolysis, (2) reduce susceptibility to oxidation, (3) alter binding affinity for GBS 
toxin, and (4) confer or modify other physicochemical or functional properties of such 
5 analogs. Analogs can include various mutations of a sequence other than the 

naturally-occurring peptide sequence, such as, for example, single or multiple amino 
acid substitutions. 

A conservative amino acid substitution should generally not substantially 
change the structural characteristics of the parent sequence (e.g., a replacement amino 
1 0 acid should not tend to break a helix that occurs in the parent sequence, disrupt 
disulfide bonds or disrupt other types of secondary structure that characterizes the 
parent sequence). Examples of art-recognized polypeptide secondary and tertiary 
structures are described in Proteins, Structures and Molecular Principles, (1984) 
Creighton (ed.), W.H. Freeman and Company, New York; Introduction to Protein 

1 5 Structure, ( 1 99 1 ), C. Branden and J. Tooze, Garland Publishing, New York, NY; and 
Thornton et al. (1991) Nature 354: 105 (which are incorporated herein by reference). 
A conservative substitution is a "replacement of an amino acid in a polypeptide by 
one with similar characteristics." (McGraw-Hill Dictionary of Scientific and 
Technical Terms, Fifth Edition, 1994, Sybil P. Parker, Editor in Chief). The structure 

20 and characteristics of naturally occurring amino acids has long been known in the art 
(Biochemistry, Second Edition, Albert L. Lehninger, 1975, pages 71-76) For 
example, amino acids which are similar by virtue of their hydrophobic R groups are 
alanine, valine, leucine, isoleucine, proline, phenylalanine, tryptophan, and 
methionine. Alanine, valine, leucine, and isoleucine are siihilS- by virtue of their 

25 aliphatic R groups. Phenylalanine and tryptophan are similar by virtue of their 
aromatic R groups. Glycine, serine, threonine, cysteine, tyrosine, asparagine, and 
glutamine are similar by virtue of their uncharged polar R groups. Glycine and 
alanine are similar by virtue of their small size. Serine and threonine are similar by 
virtue of a hydroxyl in their R group. Asparagine and glutamine differ by only one 

30 methyl group. Similarly, aspartic acid and glutamic acid differ by only one methyl 
group, and they are similar by virtue of their acidic R groups. Lysine, arginine, and 
histidine are similar by virtue of their basic R groups. In addition, lysine and arginine 
are similar by virtue of the amino groups on the end of the aliphatic chain in their R 
groups. Tyrosine and phenylalanine are similar by virtue of their aromatic groups. 

24. 
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Amino substitutions commonly made in the art include a substitution of valine for 
leucine or isoleucine, alanine for glycine, serine for threonine, asparagine for 
glutamine, aspartic acid for glutamic acid, and lysine for arginine, tyrosine for 
phenylalanine, and vice versa. 

Typically, one skilled in the art would generally refrain from changing amino 
acids that are conserved among the various GBS toxin receptors, but a conservative 
substitution might reasonably be made. For example, Table 4 guides one skilled in 
the art to avoid substitutions, particularly nonconservative substitutions, for amino 
acids corresponding to residues 1, 3-5, 7-9, 12-15, 18-23, 26-30, 32-43, 45, 47-51, 54, 
56-73, 76-80, 83, 85-92, 94-101, 103-131, 134-136, 138-143, 146-147, 150-154, 156- 
158, 160-162, 164, 167-178, 180-211,213-218,220-234, 237-238, 240-241,243-245, 
247-252, 255-256, 258, 261-270, 272-282, 284, 286-318, 320-323, 325-331, 334-337, 
339-354, 356-361, 363-365, 367-376, 378-438, 440-441, 443-444, 446-449, 451-452, 
454-460, 462-486, 489-490 and 492-494 of SEQ ID N0:4, which are conserved 
among the GBS toxin receptors shovm in Table 4. 

Tables 5 and 6 describe sequences within HP59 and SP55, respectively, that 
match predicted amidation, N-glycosylation, cAMP-phosphorylation, CK2- 
phosphosylation, myristylation (addition of unsaturated fatty acid molecules), and 
PKC-phosphosylation sites (Omega 1 . 1 sequence analysis program). The information 
contained in these tables provides guidance to one skilled in the art for designing GBS 
toxin receptor variants and fragments. When designing polypeptide variants, for 
example, one may decide to avoid substitutions in some or all of these regions. When 
designing polypeptide fragments other than immunogenic polypeptide fragments, for 
example, one may opt to include some or all of these regions. 



Tables 
Putative RecognitioD Sites 
in HP59 



Table 6 
Putative Recognition Sites 
in SP55 



Site 



Seq. ID 
NO: 8 
Residues: 



Sequence 



Site 



Seq. ID 
NO: 4 
Residues: 



Sequence 



AMIDATION 23-26 SGRR 

AMIDATION 138-141 TGKK 

ASN^GLYCOSYLATION 100-103 NLSV 

ASN_GLYCOSYLATION 112-115 NTTL 

ASN_GLYCOSYLATION 118-12! NRTS 

ASN_GLYCOSYLATION 136-139 NQTG 

ASN_^GLYCOSYLATION 266-269 UWTY 

ASN_GLYCOSYLATION 343-346 NVn*F 

ASN_GLYCOSYLATION 398-401 NFST 

CAMP PHOSPHO SrrE 297-300 KRIS 

CK2_PHOSPHO SITE 113-116 TTLE 



AMIDATION 97-100 TGKK 

ASN GLYCOSYLATION 59-62 NLSV 

ASN GLYCOSYLATION 71-74 NTTA 

ASN_GLYCOSYLATION 77-80 NRTS 

ASN^GLYCOSYLATION 95-98 NQTG 

ASN.GLYCOSYLATION 225-228 NWTY 

ASN_GLYCOSYLATION 302-305 NWTF 

ASN GLYCOSYLATION 357-360 NFST 

CK2_PH0SPH0 SITE 11-14 SDGE 

CK2_PH0SPH0_SITE 73-76 TAKD 

CK2_PH0SPH0_SITE 79-82 TSYE 
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Table 5 
Putative Recognition Sites 
in HP59 



Site 



Seq. ID 
NO: 8 
Residues: 



Sequence 



CK2_^HOSPHO_SITE H4.I17 TLED 

CK2_PH0SPH0_SITE 300-303 SHYE 

CK2_PH0SPH0_SITE 493-496 TVGE 

MYRISTYL 66-71 GAPRAE 

MYRISTYL 167-172 GGYVAS 

MYRISTYL 183-188 GILGTA 

MYRISTYL 213-218 GLGEGV 

MYRISTYL 246-251 GAQLGT 

MYRISTYL 250-255 GTVISL 

MYRISTYL 378-383 GSWLCM 

MYRISTYL 427-432 GCDYSL 

MYRISTYL 444-449 GGFCSS 

MYRISTYL 464-469 GILLGI 

MYRJSTYL 468-473 GITNTF 

PKC_PHOSPHO_SITE 23-25 SGR 

PKC_PHOSPHO_SITE 58-60 TDR 

PKC_PHOSPHO_SITE 78-80 SAR 

PKC_PHOSPHO_SITE 1 20- 1 22 TSK 

PKC_PHOSPHO_SITE 138-140 TGK 

PKC_PHOSPHO_SITE 310-312 SLR 

PKC_PHOSPHO_SITE 3 1 7-320 SQK 



Table 6 
Putative Recognition Sites 
in SPSS 



Site 



Seq. ID 
NO: 4 
Residues: 



Sequence 



CK2^PH0SPH0_SITE 259-262 

CK2_PHOSPHO_SITE 452-455 

MYRISTYL 126-131 

MYRISTYL 142-147 

MYRISTYL 162-167 

MYRISTYL 172-177 

MYRISTYL 205-210 

MYRISTYL 209-214 

MYRISTYL 337-342 

MYRISTYL 386-391 

MYRISTYL 403-408 

myristyl 423-428 

myristyl 427-432 

pkc_phospho_site 17-19 

pkc_ph0sph0_s1te 37-39 

pkc_phospho_site 55-57 

pk:c_phospho_site 73-75 

pkc_phospho_site 97-99 

pkc_ph0sph0_s1te 254-256 

pkc_phospho_site 269-27 1 

pkc_phospho_site 276-278 



TPYE 

TIGE 
GGYVAS 
G I FATA 
GALVAL 
GLGEGV 
GAQLGT 
GTVVSL 
GCWLCM 
GCDYSL 
GGFCSS 
GILLGI 
GITNTF 

SDR 

SAR 

SLR 

TAK 

TGK 

THK 

SLK 

SQK 
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In light of the foregoing, preferred polypeptides comprise an amino acid 
sequence of the formula: 

AAl-AAn-AAm 

wherein: 

AAl is absent or is M; 

AAn is a contiguous chain of 0 to 100 amino acids, preferably of 0 or 41 
amino acids, even more preferably of residues 2-42 of SEQ ID N0:8; and 

AAm is a contiguous chain of 494 amino acids comprising AA43 through 
AA536, wherein: 

_ (1) each of AA43, AA47, AA51, AA52, AA57, AA58, AA65, 
AA66, AA72, AA85, AA87, AA93, AA94, AA96, AA115, AAl 16 
AA122, AA123, AA125, AA134, AA143, AA173, AA174, AA178, 
AAl 85, AA186, AA189, AA190, AA196, AA200, AA204, AA206 
AA207, AA220, AA253, AA260, AA276, AA277, AA280, AA283! 
AA287, AA294, AA295, AA298, AA300, AA301, AA312, AA324! 
AA326, AA360, AA365, AA373, AA374, AA379, AA396, AA403 
AA407, AA418, AA480, AA483, AA486, AA491, AA494, AA502! 
AA528, AA529, AA532 and AA536 is an essential amino acid or a 
modified amino acid and preferably is an amino acid residue 
corresponding to: 

(a) residue 43, 47, 51, 52, 57, 58, 65, 66, 72, 85, 87, 93 
94, 96, 115, 116, 122, 123, 125, 134, 143. 173, 174, 178, 185, 
186, 189, 190, 196, 200, 204, 206, 207, 220, 253, 260, 276 
277, 280, 283, 287, 294, 295, 298, 300, 301, 312, 324, 326^ 
360, 365, 373, 374, 379, 396, 403, 407, 418, 480, 483, 486 



26. 
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491, 494, 502, 528, 529, 532 and 536, respectively, of SEO ID 
N0:8; 

(b) residue 2, 6, 1 0, 1 1 , 1 6, 1 7, 24, 25, 3 1 , 44, 46, 52, 
53, 55, 74, 75. 81, 82. 84, 93, 102, 132, 133, 137, 144 145 
5 148, 149, 155, 159, 163, 165, 166, 179, 212, 219. 235,' 236' 

239, 242, 246, 253, 254, 257, 259, 260, 271, 283, 285, 319' 
324, 332, 333, 338, 355, 362, 366, 377, 439, 442, 445,' 45o[ 
453, 461, 487, 488, 491 and 495, respectively of SEQ ID N0:4; 
or 

10 (c) a conservative substitution thereof; 

(2) each of AA44-AA46, AA48-AA50, AA53-AA56, AA59- 
AA64, AA67-AA71, AA73-AA84, AA86, AA88-AA92, AA95 
AA97-AA114, AA117-AA121, AA124, AA126-AA133', AA13'5- 
AA142, AA144-AA172, AA175-AA177, AA179-AA184, AA187- 

15 AA188, AA191-AA195, AA197-AA199, AA201-AA203, AA205 

AA208-AA219, AA221-AA252, AA254-AA259, AA261-AA275 ' 
AA278-AA279, AA281-AA282, AA284-AA286, AA288-AA293i 
AA296-AA297, AA299, AA302-AA311, AA313-AA323, AA325 
AA327.AA359, AA361-AA364, AA366-AA372, AA375-AA378 ' 

20 AA380-AA395, AA397-AA402, AA404-AA406, AA408-AA417,' 

AA41 9-AA478, AA48 1 -AA482, AA484-AA485, AA487-AA49o] 
AA492-AA493, AA495-AA501, AA503-AA527, AA530-AA53rand 
AA533-AA535 is 

(a) residue 44-46. 48-50, 53-56, 59-64, 67-7 1 , 73-84 
25 86, 88-92, 95, 97-1 14, 117-121, 124, 126-133, 135-142, 144- 

172, 175-177, 179-184, 187-188. 191-195, 197-199, 201-203, 
205, 208-219, 221-252, 254-259, 261-275, 278-279, 281-282, 
284-286, 288-293, 296-297, 299, 302-31 1, 313-323, 325, 327- 
359, 361-364, 366-372, 375-378. 380-395, 397-402, 404-406, 
30 408-417, 419-478, 481-482, 484-485, 487-490, 492-493, 495- 

501, 503-527, 530-531 and 533-535. respectively, of SEQ ID 
N0:8; or 

(b) a conservative substitutions thereof; and 
(3) AA315 through AA367 are optionally absent. 

35 Preferred polypeptides comprise the amino acid sequence of SEQ ID N0:4, 

SEQ ID NO:8 or an amino acid sequence which varies from that sequence only at the 
specific residues which are not conserved between the sheep GBS toxin receptor 
(SEQ ID NO:4) and the human GBS toxin receptor (SEQ ID NO:8). Of those 
variations, the most preferred variations are those resulting in a polypeptide encoded 

40 by SEQ ID NO: 1 1 . Even more preferred variations are those amino acids in the 
corresponding positions of the amino acid sequence of SEQ ID N0:4. Particularly 
preferred are polypeptides comprising an amino acid sequence that differs from SEQ 
ID N0:2, SEQ ID N0:4 or SEQ ID N0:8 at no more than about 20% of the amino 
acid residues, with increasing preference for no more than about 10%, 5%, 1%, with 

45 one to zero amino acid differences being most preferred. 

27. 
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Besides targeting specific amino acids for change, analogs of GBS toxin 
receptor can also be prepared by techniques involving activity selection, such as, for 
example, phage display, directed evolution, DNA shuffling, and homologous in 
procaryotes or eucaryotes of genes from different species, as described in part in U.S. 
5 Patent Nos. 5,605,793; 5,830,721; 5,811,238; 5,837,458; 5,093,257; 5,223,409; 
5,403,484; 5,571,698; and 5,837,500, which are incorporated herein by reference. 

Any variant or fragment of the human and sheep GBS toxin receptors 
described herein can be tested for the requisite activity by determining whether the 
variant or fragment can bind GBS toxin. 
10 These polypeptides provide reagents useful in drug discovery and purification 

and can be used in various in vitro assays, preferably when expressed on the surface 
of a cell, e.g., a stable transfected cell. For example, assays such as binding assays 
can be used to screen test compounds, including polysaccharides and other 
compounds, for their ability to bind the GBS toxin receptor. Assays can identify 
1 5 potential drug candidates that block GBS toxin binding to the GBS toxin receptor. 
Such drugs are usefiil for preventing and/or treating early onset disease in neonatal 
humans. Some polypeptides can be used to competitively inhibit binding GBS toxin 
to a GBS toxin receptor. 
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The polypeptides of the invention can be used to affinity purify GBS toxin, a 
GBS toxin chimeric compound, and other polysaccharides or compounds which can 
bind the GBS toxin receptor. 

The polypeptides can also be used to develop a method of targeting a 
5 cytotoxic agent for delivery to a cell that expresses a GBS toxin receptor. For 
example, a cytotoxic agent can be coupled to a molecule that binds a GBS toxin 
receptor for selective delivery to the neovasculature of a growing tumor. Such a 
delivery system would permit a highly concentrated, localized attack on a growing 
tumor, while minimizing the adverse systemic side effects encountered with most 

10 chemotherapeutics. In one instance, the cytotoxic agent can be GBS toxin, which, 
upon binding to GBS toxin receptor, induces an inflammatory response as described 
in Hellerqvist et al.. Angiogenesis: Molecular Biolog v. CHnical Aspects . Edited by 
M.E. Maragoudakis et al.. Plenum Press, New York 1994, pp. 265-269. In a similar 
manner, selective delivery of a therapeutic agent to a cell that expresses a GBS toxin 

1 5 receptor could be used advantageously to treat tumors, rheumatoid arthritis or neural 
injury, or to facilitate wound healing. 

The polypeptides of the invention can also be used to screen for and/or design 
a GBS toxin mimetic with improved therapeutic properties, such as, for example, 
improved ability to inhibit hypoxia-induced neovascularization or angiogenesis. Such 

20 mimetics are useful in the treatment and prevention of conditions resulting from 
hypoxia-induced neovascularization or angiogenesis, such as, for example, tumor 
growth, scarring during wound healing, gliosis during repair of neural injury, 
reperfiision injury, restenosis, rheumatoid arthritis, psoriasis, other chronic 
inflammatory diseases charactOTzed by angiogenesis, etc. Therapeutic properties can 

25 be improved by enhancing biological stability, affinity for the GBS toxin receptor, 
complement binding activity, reducing antigenicity, etc. 

The polypeptides of the invention can also be used to generate antibodies for 
various therapeutic and research purposes. The polypeptides of the invention can be 
used to inmiunize rabbits, mice, goats, chickens, or other animals known in the art to 

30 be amenable to such immimization. Monoclonal antibodies are generally preferred 

but polyclonal antibodies can also be used, provided that detection of binding of the 

GBS toxin receptor antibody to the GBS toxin receptor is possible. The production of 

non-human monoclonal antibodies, e.g., murine, is well known (see, e.g., Harlow et 

al., Antibodies A Laboratory Manual, Cold Spring Harbor Press, pp. 1 39-240, 1 989, 

29. 
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incorporated herein by reference). As it may be difficult to generate human 
monoclonal antibodies to a human receptor or binding domain polypeptide, it may be 
desirable to transfer antigen binding regions of non-human monoclonal antibodies, 
e.g. the F(ab')2 or hypervariable regions or murine monoclonal antibodies, to human 
constant regions (Fc) or framework regions by recombinant DNA techniques to 
produce substantially human molecules. Such methods are generally known and are 
described in, e.g., U.S. Pat. Nos. 4,816,397 and 4,946,778, and EP publications 
1 73,494 and 239,400. Alternatively, one may isolate DNA sequences which code for 
a human monoclonal antibody or portions thereof that specifically bind to the receptor 
protein by screening a DNA library from human B cells according to the general 
protocol outlined in WO 90/14430, and then cloning and amplifying the sequences 
which encode the antibody (or binding fragment) of the desired specificity. 

Usually, polypeptides used for producing antibodies are the full-length 
receptor or receptor fragments designed from putative extracellular domains identified 
by a variety of methods known in the art, including computer programs which predict 
secondary and tertiary structure of a polypeptide based upon its primary amino acid 
sequence. Another method for designing antigenic peptides utilizes computer 
programs that predict the high points of hydrophilicity within a particular primary 
amino acid sequence. For example, using the method of Happ and Woods, Proc. 
NatL Acad, ScL USA (1981) 78:3824-3829, via the "Antigen" program in PC/GENE, 
the inventors identified 3 regions of high hydrophilicity, shown below in Table 7, and 
used the results to design antigenic peptides to be used in the preparation of antibodies 
against GBS toxin receptor (see Example 4). 



25 



Table 7 

High Points of Hydrophilicity in SPSS 



No. 


Ah 


Sequence 


1 


2.05 


Glu-Glu-Gly-Ser-Asp-Arg (14-19 of SEQ ID No. 2) 


2 


1.52 


Lys-Asp-Asn-Arg-Thr-Ser (75-80 of SHQ ID No. 2) 


3 


1.33 


Arg-Ala-Pro-Arg-Ala-Glu (25-30 of SEQ ID No. 2) 



30 



Antibodies that recognize various portions of the intact GBS toxin receptor 
can be used to further investigate structure and function of the receptor. The 
polypeptides of the invention can give rise to antibodies that recognize a variety of 
forais of GBS toxin receptor, including, but not limited to, intact GBS toxin receptor 

30. 
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expressed on a cell surface, denatured GBS toxin receptor or non-denatured GBS 
toxin receptor, and GBS toxin receptor purified away from cellular components or 
GBS toxin receptor contained in a cell lysate. GBS toxin receptor antibodies can be 
used to study species differences as well as GBS toxin receptor expression levels in 
various cell types. 
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Antibodies that recognize a portion or all of an extracellular domain are 
particularly useful as a diagnostic for the monitoring of tumor growth and metastasis, 
for the detection or identification of a chronic inflammatory condition, such as, for 
example, rheumatoid arthritis or psoriasis, and for the detection of other medical 
5 conditions arising due to hypoxia-driven angiogenesis, such as, for example, 
restenosis. Typically, such antibodies can be employed in a variety of standard 
research and diagnostic techniques, including, but not limited to, western blot, 
immunoprecipitation, ELISA, radioimmunoassay (RIA), BIACOR®, enzyme-linked- 
immunoassay (EIA), immunofluorescence, fluorescence activated cell sorting 
10 (FACS), and in vivo diagnostic imaging systems such as magnetic resonance imaging 
(MRI), nuclear magnetic resonance (NMR), computerized axial tomography (CAT) 
scan, and position emission tomography (PET), etc. 

In addition, antibodies that block the binding of GBS toxin to a GBS toxin 
receptor can be used for the treatment or prevention of early onset disease in a 
15 neonatal human. Such antibodies can directly or indirectly block the GBS toxin 
binding site on the GBS toxin receptor. 

In one embodiment, the GBS toxin receptor protein is naturally occurring and 
can be isolated from a cell extract by protein purification techniques known in the art, 
such as, for example, ion exchange colunm chromatography, high performance liquid 
20 chromatography (HPLC), reversed phase HPLC, or affinity chromatography using 
antibodies that recognize the GBS toxin receptor. 

Alternatively, the isolated proteins and polypeptides are expressed using 
polynucleotides encoding the polypeptide(s) of the invention in operative association 
with an appropriate control sequence including a promoter in an expression vector 
25 suitable for expression, preferably in a mammalian cell, and also in bacterial, insect, 
or yeast cells. 

Usually, the GBS toxin receptor polynucleotide or a firagment thereof can be 
expressed in a mammalian system. Such expression will usually depend on a 
mammalian promoter, which is any DNA sequence capable of binding mammalian 
30 RNA polymerase and initiating the downstream (3 ') transcription of a coding 
sequence (e.g. structural gene) into mRNA. Usually, a promoter will have a 
transcription initiation region which is usually placed proximal to the 5' end of the 
coding sequence. This transcription initiation region typically includes an RNA 
polymerase binding site and a transcription initiation site. 

32. 
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Vectors suitable for replication in mammalian cells are known in the art, and 
can include viral replicons, or sequences that ensure integration of the sequence 
encoding PAK65 into the host genome. Suitable vectors can include, for example, 
those derived from simian virus SV40, retroviruses, bovine papilloma virus, vaccinia 
virus, and adenovirus. 

A suitable vector, for example, is one derived from vaccinia viruses. In this 
case, the heterologous DNA is inserted into the vaccinia genome. Techniques for the 
insertion of foreign DNA into the vaccinia virus genome are known in the art, and 
utilize, for example, homologous recombination. The insertion of the heterologous 
DNA is generally into a gene which is non-essential in nature, for example, the 
thymidine kinase gene (tk), which also provides a selectable marker. Plasmid shuttle 
vectors that greatly facilitate the construction of recombinant viruses have been 
described (see, for example, Mackett et al. (1984); Chakrabarti et al. (1985); Moss 
(1987)). Expression of the heterologous polypeptide then occurs in cells or 
individuals which are immunized with the live recombinant vaccinia virus. 

Such suitable mammalian expression vectors usually contain one or more 
eukaryotic transcription units that are capable of facilitating expression in mammalian 
cells. The transcription unit is comprised of at least a promoter element to mediate 
transcription of foreign DNA sequences. Suitable promoters for mammalian cells are 
known in the art and include viral promoters such as those from simian virus 40 
(SV40) (Subramani et al., Mol Cell. Biol. 1:854-864, 1981), cytomegalovirus (CMV) 
(Boshart et al.. Cell 41 :521-530, 1985), Rous sarcoma virus (RSV), adenovirus 
(ADV) (Kaufinan and Sharp, Mol. Cell. Biol. 2:1304-1319, 1982), and bovine 
papilloma virus (BPV), as well as cellular promoters, such as a mouse _ . 
metallothionein-l promoter (U.S. Patent No. 4,579,821), a mouse VK promoter 
(Bergman et al., Proc. Natl. Acad. Sci. USA 81:7041-7045, 1993; Grant et al., Nuc. 
Acids Res. 15:5496, 1987), and a mouse VH promoter (Loh et al.. Cell 33:85-93, 
1983). 

The optional presence of an enhancer element (enhancer), combined with the 
promoter elements described herein, will typically increase expression levels. An 
enhancer is any regulatory DNA sequence that can stimulate transcription up to 1000- 
fold when linked to endogenous or heterologous promoters, with synthesis beginning 
at the normal mRNA start site. Enhancers are also active when they are placed 

upstream or downstream from the transcription initiation site, in either normal or 
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flipped orientation, or at a distance of more than 1000 nucleotides from the promoter 
(Maniaiis et al. (1987) Science 236:1237; Alberts et al. (1989) Molecular Biology of 
the Cell, 2nd ed.). Enhancer elements derived from viruses can be particularly useful, 
because they typically have a broader host range. Examples useful in mammalian 
cells include the SV40 early gene enhancer (Dijkema et al (1985) EMBO J. 4:761) 
and the enhancer/promoters derived from the long terminal repeat (LTR) of the Rous 
Sarcoma Vims (Gorman et al. (1982b) Proc. Natl. Acad. Sci. 79:6777), from human 
cytomegalovirus (Boshart et al. (1985) Cell 41 :521) as well as the mouse enhancer 
(Gillies, Cell 33:717-728, .1983). Additionally, some enhancers are regulatable and 
become active only in the presence of an inducer, such as a hormone or metal ion 
(Sassone-Corsi and Borelli (1986) Trends Genet. 2:215; Maniatis et al. (1987) 
Science 236:1237). 

In addition, the transcription unit can also be comprised of a termination 
sequence and a polyadenylation signal v^hich are operably linked to the GBS toxin 
receptor coding sequence. Polyadenylation signals include, but are not limited to, the 
early or late polyadenylation signals from SV40 (Kaufinan and Sharp), the 
polyadenylation signal from the Adenovirus 5 BIB region and the human growth 
hormone gene terminator (DeNoto et al., Nuc. Acids Res. 9:3719-3730, 1981). 

Sequences that cause amplification of the gene may also be desirable, as are 
sequences which encode selectable markers. Selectable markers for mammalian cells 
are known in the art, and include, for example, thymidine kinase, dihydrofolate 
reductase (together with methotrexate as a DHFR amplifier), aminoglycoside 
phosphotransferase, hygromycin B phosphotransferase, asparagine synthetase, 
adenosine deaminase, and antibiotic resistant genes such as neomycin. 

A GBS toxin receptor, or Augment thereof, can be expressed on the surface of 
a cell, or can be expressed in soluble or secreted form. Expression on the surface of 
the cell can be achieved, for example, by including a secretory leader operably linked 
to a nucleic acid sequence encoding the desired receptor Augment and at least one 
transmembrane domain. The secretory leader can be that encoded by the GBS toxin 
receptor gene, or can be a heterologous leader sequence conmionly used in the art, 
such as, for example, the leader sequence of Schizosaccharomyces pombe phol+ acid 
phosphatase (Braspenning et al., Biochem Biophys Res. Commun (1998) 245:166- 
71), the leader sequence of human interleukin-2 (IL-2) gene (Sasada et al.. Cell Struct 

34. 



wo 00/05375 PCT/US99/16676 

Funct (1988) 13: 129-141). Expression in solul?le or secreted form can be achieved, 
for example, by excluding from the gene construct nucleic acid sequences encoding a 
transmembrane domain. In some instances, solubility and/or secretion are achieved 
by the use of a fusion partner, such as, for example, chloramphenicol acetyltransferase 
5 (CAT), P-galactosidase, and other genes readily expressed in the selected host cell. 

The vector that encodes GBS toxin receptor can be used for transformation of 
a suitable mammalian host cell. Transformation can be by any known method for 
introducing polynucleotides into a host cell, including, for example packaging the 
polynucleotide in a virus and transducing a host cell with the virus or by transfection 

1 0 procedures known in the art, as exemplified by U.S. Patent Nos. 4,399,216, 
4,912,040, 4,740,461, and 4,959,455 (these patents are incorporated herein by 
reference). The transformation procedure used depends upon the host to be 
transformed. Methods for introduction of heterologous polynucleotides into 
mammalian cells are known in the art and include dextran-mediated transfection, 

1 5 calcium phosphate precipitation, polybrene mediated transfection, protoplast fusion, 
electroporation, encapsulation of the polynucleotide(s) in liposomes, and direct 
microinjection of the DNA into nuclei. 

Mammalian cell lines available as hosts for expression are known in the art 
and include many immortalized cell lines available from the American Type Culture 

20 Collection (ATCC), including but not limited to Chinese hamster ovary (CHO) cells, 
HeLa cells, baby hamster kidney (BHK) cells, monkey kidney cells (COS), NlE-115 
(Liles et ah, J. Biol. Chem. 261:5307-5313, 1986), PC 12 human hepatocellular 
carcinoma cells (e.g.. Hep G2), and a number of other cell lines, such as insect 
derived cell lines IF9 ancl IF2 1 . Cell lines of particular preference are those expressing 

25 recombinant GBS toxin receptor constructs constitutively, lines which subsequently 
develop characteristics of a transformed cell, and lines which more preferably express 
GBS toxin receptor or fragments on the cell surface. Particularly preferred are ECV 
cells (a bladder carcinoma cell line originally referred to in the scientific literature as 
an endothelial cell line), human umbilical vein endothelial cells (HUVEC), bovine, 

30 sheep, and human adrenal medulla endothelial cells: 

Recombinant GBS toxin receptor or Segments thereof can be produced by 
culturing host cells expressing the receptor or fragment in a suitable culture medium 
and under appropriate cell culture conditions. Culture media and conditions are 
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variable depending on the requirements of a particular host cell line and are well 
known in the art. Typically, cells are cultured at ZTC in a cell culture incubator with 
a fixed amount of C02, usually in the range of 5-10%. 

In another embodiment, the polypeptide fragments can be synthesized 
5 chemically by techniques well known in the art, such as solid-phase peptide synthesis 
(Stewart et al., Solid Phase Peptide Synthesis, W.H. Freeman Co., San Francisco 
(1963)); Merrifield, J Am Chem Soc 85:2149-2154 (1963)). These and other methods 
of peptide synthesis are also exemplified by U.S. Patent Nos. 3,862,925, 3,842,067, 
3,972,859, and 4,105,602. The synthesis can use manual synthesis techniques or 

10 automatically employ, for example, an Applied BioSystems 430A or 431 A Peptide 
Synthesizer (Foster City, California) following the instructions provided in the 
instruction manual supplied by the manufacturer. It will be readily appreciated by 
those having ordinary skill in the art of peptide synthesis that the intermediates which 
are constructed during the course of synthesizing the present analog compounds are 

15 themselves novel and useful compounds and are thus within the scope of the 
invention. 

In addition to polypeptides consisting only of naturally-occurring amino acids, 
peptidomimetics are also provided. Peptide analogs are commonly used in the 
pharmaceutical industry as non-peptide drugs with properties analogous to those of 

20 the template peptide. These types of non-peptide compounds are termed "peptide 
mimetics" or "peptidomimetics" (Fauchere, J. (1986) Adv. Drug Res. 15: 29; Veber 
and Freidinger (1985) TINS p.392; and Evans et al. (1987) J. Med. Chem 30: 1229, 
which are incorporated herein by reference) and are usually developed with the aid of 
computerized molecular modeling. Peptide mimetics that are structurally similar to 

25 therapeutically useful peptides may be used to produce an equivalent therapeutic or 
prophylactic effect. Generally, peptidomimetics are structurally similar to a paradigm 
polypeptide (i.e., a polypeptide that has a biochemical property or pharmacological 
activity) but have one or more peptide linkages optionally replaced by a linkage 
selected from the group consisting of: -CH2NH-, -CH2S-, -CH2-CH2-, -CH=CH- 

30 (cis and trans), -C0CH2-, -CH(0H)CH2-, and -CH2SO., by methods known in the art 
and further described in the following references: Spatola, A.F. in "Chemistry and 
Biochemistry of Amino Acids, Peptides, and Proteins," B. Weinstein, eds., Marcel 
Dekker, New York, p. 267 (1983); Vol. 1, Issue 3, "Peptide Backbone Modifications" 
(general review); Morley, J.S., Trends Pharm Sci (1980) pp. 463-468 (general 
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review); Hudson, D. et al., Int J Pept Prot Res (1979) 14:177-185 (-CH2NH-, 
CH2CH2-); Spatola, A.F. et al., Life Sci (1986) 38:1243-1249 (-CH2-S); Hann, 
M.M., J Chem Soc Perkin Trans I (1982) 307-314 (-CH-CH-, cis and trans); 
Almquist, R.G. et al, J. Med Chem (1980) 23:1392-1398 (-C0CH2-); Jennings- 
5 White, C. et al., Tetrahedron Lett (1982) 23:2533 (-C0CH2-); Szelke, M. et al., 

European Appln. EP 45665 (1982) CA: 97:39405 (1982) (-CH(0H)CH2-); Holladay, 
M.W. et al., Tetrahedron Lett (1983) 24:4401-4404 (-C(0H)CH2-); and Hruby, V.J., 
Life Sci (1982) 31:189-199 (-CH2-S-); each of which is incorporated herein by 
reference. A particularly preferred non-peptide linkage is -CH2NH-. Such peptide 

1 0 mimetics may have significant advantages over polypeptide embodiments, including, 
for example: more economical production, greater chemical stability, enhanced 
pharmacological properties (half-life, absorption, potency, efficacy, etc.), altered 
specificity (e.g., a broad-spectrum of biological activities), reduced antigenicity, and 
others. Labeling of peptidomimetics usually involves covalent attachment of one or 

15 more labels, directly or through a spacer (e.g., an amide group), to non-interfering 
position(s) on the peptidomimetic that are predicted by quantitative structure-activity 
data and/or molecular modeling. Such non-interfering positions generally are 
positions that do not form direct contacts with GBS toxin (e.g., are not contact points 
in the GBS toxin binding domain of the GBS toxin receptor). Derivitization (e.g., 

20 labelling) of peptidomimetics should not substantially interfere with the desired 
biological or pharmacological activity of the peptidomimetic. 

Systematic substitution of one or more amino acids of a consensus sequence 
with a D-amino acid of the same type (e.g., D-lysine in place of L-lysine) may be 
used to generate more stable peptides: In addition, constrained peptides comprising a 

25 consensus sequence or a substantially identical consensus sequence variation may be 
generated by methods known in the art (Rizo and Gierasch (1992) Ann. Rev. 
Biochem. 61: 387, incorporated herein by reference); for example, by adding internal 
cysteine residues capable of forming intramolecular disulfide bridges which cyclize 
the peptide, 

30 The invention also provides a complex comprising a GBS toxin bound to a 

mammalian GBS toxin receptor or a fi-agment of a mammalian GBS toxin receptor. 
Preferably, the complex comprises a GBS toxin bound to a GBS toxin receptor 
polypeptide described above that can bind GBS toxin. Typically, a complex is 
formed by contacting a GBS toxin with such a polypeptide under conditions that 
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permit specific binding of the GBS toxin to the polypeptide. The GBS toxin can be 
labeled or unlabeled. The polypeptide can be present on the surface of a cell, or 
immobilized in a well or on a bead, or the polypeptide can be present in solution. 

Detection Methods 

Yet another aspect of the invention provides methods for detecting or 
monitoring a variety of medical conditions characterized by pathologic and/or 
hypoxia-driven angiogenesis or neovascularization. Examples include, but are not 
limited to, early onset disease in the neonate, and the progression of cancers involving 
tumors. 

Early onset disease can be diagnosed by detecting the presence or absence of 
GBS toxin in a patient. One method of detection is a competition assay that 
determines the effect of a suspected sample on the formation of a complex between 
GBS toxin and a GBS toxin receptor or fragment thereof. For example, the method 
comprises contacting a control GBS toxin with a GBS toxin receptor polypeptide, in 
the presence and absence of a sample suspected of containing GBS toxin and under 
conditions that permit specific binding of the GBS toxin to the polypeptide, and 
comparing the amount of complex fonnation achieved in the presence of the 
suspected sample to the amount of complex formation achieved in the absence of the 
suspected sample. Preferably, the control GBS toxin is substantially purified and of a 
known concentration. Preferably, the control GBS toxin further comprises a label. 
Suitable labels include, but are not limited to, radioisotopes, chromophores, 
fluorophores, biotin, avidin, and other labels used by one skilled in the art. Another 
method directly measures, rather than by competition with a control GBS toxin, 
complex formation between GBS toxin present in a suspected sample and a GBS 
toxin receptor polypeptide. 

Pathologic vasculature can be detected in a manmialian tissue by detecting the 
presence or absence of GBS toxin receptor in the region of a tumor, with the presence 
of GBS toxin receptor being indicative of the presence of pathologic vasculature. The 
method can be used to monitor tumor growth or metastasis. One method of detection 
involves the use of molecules, e.g. antibodies, that specifically bind to a GBS toxin 
receptor, preferably an extracellular domain of GBS toxin receptor. Typically, the 
method comprises administering, to a mammalian tissue, e.g. in a mammal having a 
cancerous tumor, e.g., an antibody that recognizes a GBS toxin receptor, and 
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detecting specific binding of the antibody. Typically, the antibody is a labeled 
antibody. Preferably, the observations are quantitative and can be visual. 

During surgery, the margin of a tumor can be visualized by any of a number of 
imaging techniques known in the art and described above. The imaging of the tumor 
5 is effected by detecting the binding of a labeled antibody or other molecules to the 
GBS toxin receptor on the pathologic vasculature of a tumor This type of surgery is 
also known as virtual surgery because while performing the surgery, the surgeon 
views the tumor indirectly on an imaging screen. 

10 Drug Discovery 

A fourth aspect of the invention provides methods, using the polypeptides of 
the invention, of identifying drug candidates for the treatment of medical conditions 
characterized by hypoxia-driven angiogenesis or neovascularization. Preferred 
compounds are competitive inhibitors of GBS toxin binding to a GBS toxin receptor 

1 5 or inhibit GBS toxin receptor activity. Particularly preferred are compounds that 

inhibit the first phosphorylation step in the signal transduction pathway. Compounds 
can be produced by a variety of random drug design methods commonly known in the 
art, such as, for example, combinatorial chemistry (U.S. Patent No. 5,646,285; U.S. 
Patent No. 5,639,603), peptide libraries (U.S. Patent No. 5,591,646; U.S. Patent No. 

20 5,367,053; U.S. Patent No. 5,747,334), phage display (U.S. Patent No. 5,403,484; 
U.S. Patent No. 5,223,409), SELEX® (U.S. Patent No. 5,773,598; U.S. Patent No. 
5,763,595; U.S. Patent No. 5,763,566), and combinatorial carbohydrate chemistry 
(Hirschmann et ah, J Med Cham (1996) 39:2441-2448; Hirschmann et al., J Med 
Chem (1998)41:1382-1391; Sofia MJ/MotDivers (19^^^ 3:75-94; U.S. Patent No. 

25 5,780,603; U.S. Patent No. 5,756,712) 

An alternative z^proach is rational drug design with the intent of producing a 
GBS toxin mimetic or a GBS toxin receptor mimetic with improved therapeutic 
properties using techniques such as x-ray crystallography, nuclear magnetic resonance 
(NMR) correlation spectra (U.S. Patent No. 5,698,401), computer assisted molecular 

30 modeling (U.S. Patent No. 5,579, 250; U.S. Patent No. 5,612,895; U.S. Patent No. 
5,680,331, Cooper et al., J. Comput.-Aided Mol. Design, 3:253-259 (1989); Brent et 
al., J. Comput.-Aided Mol Design 2:3 1 1 -3 1 0 ( 1 988)) and other methods of rational 
drug design known in the art. FIG. 1 provides a broad overview of some of the main 
steps in some of the rational drug design methods of the present invention. For 
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example, one approach to rational drug design involves a computer program, such as 
INSIGHTII (available from Bisoym Technologies, 10065 Barnes Canyon Road, San 
Diego, California) to identify active sites in proteins by homology-based modeling. 
This method facilitates the modeling of a protein by using a similar protein whose 
5 structure is well known. Commercial software containing search algorithms for three 
dimensional database comparisons are available from vendors such as Day Light 
Information Systems, Inc., Irvine, California 92714, and Molecular Design Limited, 
2132 Faralton Drive, San Leandro, California 94577. 

In one embodiment, the compound can bind the GBS toxin receptor and 
1 0 induce an inflammatory response in a manner similar to the binding of GBS toxin to 
the GBS toxin receptor. Such compounds can be used, for example, as a drug to 
target an inflammatory response to the developing vasculature of a tumor. 

In another embodiment, the compound can bind the GBS toxin receptor with 
or without inducing an inflammatory response, preferably without inducing an 
1 5 inflammatory response. In one instance, the compound can be used as a vehicle to 
target pathological neovasculature for treatment with a cytotoxic agent. For example, 
the cytotoxic agent can be chemically coupled to the compound to form a chimeric 
drug. Such chimeric drugs can be used in the treatment of tumors, rheumatoid 
arthritis, wound healing, spinal cord injury, and other conditions characterized by 
20 hypoxia-driven angiogenesis or neovascularization. In another instance, the 

compound can be used directly to competitively inhibit binding of GBS toxin to a 
GBS toxin receptor. Such compounds can be used in the treatment of early-onset 
disease in the neonate. 

In a third embodiment, the compound can bind GBS toxin and can be used in 
25 the treatment of early-onset disease in the neonate. 

The polynucleotides of the invention can be expressed in random mutagenesis 
systems such as phage display or the yeast two-hybrid system for the synthesis and 
identification of mutant peptide GBS toxin receptor polypeptides that bind GBS toxin. 
Ahematively, immobilized or soluble GBS toxin receptor fragments of the invention 
30 can be used to screen combinatorial peptide and combinatorial chemical libraries and 
non-random recombinant and synthetic peptides and other compounds (such as non- 
peptide molecules) for GBS toxin receptor binding. Compounds that bind GBS toxin 
or GBS toxin receptor can then be further characterized in a functional assay for any 
of the activities described above in order to identify a drug candidate for the treatment 
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of medical conditions involving angiogenesis or neovascularization. 

A compound which inhibits binding of GBS toxin to a GBS toxin receptor can 
be identified by combining a test compound with a mammalian GBS toxin receptor or 
fragment thereof capable of binding GBS toxin, under conditions that permit specific 
5 binding of GBS toxin to the GBS toxin receptor or fragment, and determining the 
amount of inhibition by the compound of the binding of GBS toxin to the GBS toxin 
receptor or fragment. 

In a preferred embodiment, the GBS toxin receptor or fragment is expressed 
by a cell, preferably on the cell surface. The cells are contacted with labeled GBS 

10 toxin in the presence or absence of the test compound. A change in the binding of 
GBS toxin to the GBS toxin receptor is then determined. Altematively, the GBS 
toxin is unlabeled and an antibody that recognizes GBS toxin is labeled instead. The 
labeled antibody is used to measure inhibition by a compound of GBS toxin binding 
to the GBS toxin receptor or fragment. In another embodiment, the GBS toxin 

15 receptor or fragment is not associated with a cell, but is instead coupled to a matrix, 
such as, for example, a well in a microtiter plate or a bead. Additional suitable solid 
supports include latex, polystyrene beads (Interfacial Dynamics Corp. Portland, 
Oreg.), magnetic particles (Advanced Magnetics, Cambridge, Mass.) and nylon balls 
(Hendry et al, J. Immunological MetK 35:285-296, 1980). The receptor or fragment 

20 can be coupled to the matrix directly or indirectly through an antibody, coupled to the 
matrix, that binds the receptor fragment. In a third embodiment, the GBS toxin 
receptor or fragment is soluble and can be immunoprecipitated with an antibody that 
recognizes the receptor or fragment. 

' A preferred method for identifying a compound which binds a mammalian 

25 GBS toxin receptor comprises the steps of (1) combining a test compound with a GBS 
toxin receptor or fragment thereof under conditions that allow specific binding to 
occur, and (2) detecting a complex formed between the test compound and the GBS 
toxin receptor or fragment, A preferred method is a competition assay which 
determines the ability of the test compound to compete for binding to the GBS toxin 
30 receptor or fragment. In such an assay, GBS toxin is combined with the GBS toxin 
receptor or fragment in the presence or absence of the test compound. Decreased 
specific binding of GBS toxin in the presence versus the absence of the test compound 
is indicative of the ability of the test compound to bind a mammalian GBS toxin 

receptor. Another method comprises combining a control compound with the GBS 
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toxin receptor or fragment under the same conditions as the test compound and 
comparing the amount of complex formation between the test compound or the 
control compound and the GBS toxin receptor or fragment thereof Preferably, the 
test compound and/or the control compound are labeled. The test compound can be 
any of a number of classes of compounds, such as for example, small organic 
molecules (such as those used for and obtained by combinatorial chemistry), 
polysaccharides, polypeptides, RNA, antibodies, and single chain antibodies. In a 
preferred embodiment, the polypeptide is expressed by a cell, preferably on the 
surface of the cell, and preferably by a stable transfected cell. Such a system is 
particularly useful for testing the effectiveness of a chimeric compound comprising a 
cytotoxic agent. The cytotoxic activity of the compound can be determined by 
exposing a cell expressing the GBS toxin receptor on the cell surface to the test 
chimeric compound and detecting signs of cytotoxicity. One could detect such signs 
by a viability stain of the ceil, by detecting apoptosis (for example, by a DNA ladder 
assay or a TUNEL™ stain, which binds to broken DNA), by measuring tritiated 
thymidine incorporation into the cell, and by quantitating kinase-dependent 
phosphorylation (e.g., using phosphoantibodies or various phosphoimaging 
techniques). 

In another embodiment, the invention provides a method for identifying an 
inhibitor of GBS toxin receptor. The method comprises incubating test cells in the 
presence and absence of a test compound. The test cells express GBS toxin receptor 
or a fragment thereof having GBS toxin receptor activity (e.g., a fragment that 
increases the proliferation or migration of the expressing cells relative to confrol cells 
of the same cell type tiiat do not express the fragment). The test cells are incubated 
under conditions in which the cells incubated in the absence of the test compound can 
proliferate or migrate. Control cells that do not express the GBS toxin receptor or 
fragment proliferate or migrate less than cells that express the GBS toxin receptor or 
fragment. The proliferation or migration (also referred to herein as motility) of the 
test cells incubated in the presence or absence of the test compound is compared. Less 
proliferation or migration in the presence of the test compound than in the absence of 
the test compound is indicative of the test compound being an inhibitor of the GBS 
toxin receptor. Preferably, as a control to determine whether the test compound 
specifically inhibits the GBS toxin receptor, the proliferation or migration of control 
cells in the presence and absence of the test compound is also compared. In the 
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absence of a difference in the proliferation or migration of control cells incubated in 
the presence or absence of the test compound, decreased proliferation or migration in 
test cells exposed to the test compound relative to test cells not exposed to the test 
compound is indicative of specific inhibition of the GBS toxin receptor. It will be 
5 readily apparent that the control portions of the method need not be performed 

contemporaneously with the test portions of the method. For example, control cells 
can be incubated with a battery of test compounds to determine cellular effects of the 
test compounds prior to incubating the test cells with the test compounds. Motility or 
migration can be determined by detecting movement of cells on a culture dish. 

10 Proliferation can be detected in a number of ways, including, but not limited to, 
measuring tritiated thymidine incorporation, cell counts, apoptosis assays, and 
viability assays. Prefenred cells include cells transfected with GBS toxin receptor, 
preferably endothelial cells transfected with GBS toxin receptor, even more 
preferably vascular endothelial cells or microvascular endothelial cells. Primary cells 

15 that express GBS toxin receptor are also preferred, for example, endothelial cells that 
have been passaged in cell culture, at confluence, no more than 8 or 9 times. A 
preferred class of test compounds includes kinase inhibitors, preferably cAMP- 
dependent kinase inhibitors, PKC inhibitors, and CK2 inhibitors, which can be used 
as a starting point for developing more specific GBS toxin receptor inhibitors. 

20 Another class of compounds includes antibodies specific for GBS toxin receptor. 
Particularly preferred are single chain antibodies, preferably a collection of single 
chain antibodies that recognize various epitopes on the GBS toxin receptor. Less 
preferred are divalent antibodies specific for the binding site of the GBS toxin 

receptor ligand because they may trigger the signal transduction cascade upon 
25 dimerization. 

Another embodiment of the invention is a method of identifying an inhibitor 
of endothelial cell proliferation or migration, which are essential components of 
angiogenesis. The method basically comprises the steps described in the preceding 
paragraph and uses endothelial cells. 
30 Yet another embodiment of the invention is a method of identifying a 

therapeutic compound for the treatment or prevention of a medical condition 
characterized by pathologic or hypoxia-driven angiogenesis or neovascularization. 
The method basically comprises the steps described above and uses cells from tissues 
derived from manmials afflicted with the medical condition or cells that serve as a 
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model for afflicted tissue. 

A preferred method for designing a compound which inhibits binding of a 
GBS toxin to a mammalian GBS toxin receptor comprises (1) simulating and 
selecting the most probable conformations of a GBS toxin receptor or fragment 
5 thereof, (2) designing a chemically modified analog that substantially mimics the 
energetically most probable three-dimensional structure of the GBS toxin receptor or 
fragment, (3) chemically synthesizing the analog, and (4) evaluating the bioactivity of 
the analog. Preferably, steps (a) and (b) are performed with the aid of a computer 
program. 

1 0 A preferred method for designing a compound which binds to a mammalian 

GBS toxin receptor comprises (1) simulating and selecting the most probable 
conformations of a GBS toxin receptor or fragment thereof, (2) deducing most 
probable binding domains of the receptor or fragment, (3) designing a compound that 
would form the energetically most probable complexes with the receptor or fragment, 

15 (4) chemically synthesizing the compound, and (5) evaluating the bioactivity of the 
compound. Preferably, steps (a)-(c) are performed with the aid of a computer 
program. 

Preferred polypeptides for use in the screening assays described above are 
polypeptides sharing at least about 85% identity, preferably at least about 95% 
20 identity, and most preferably greater than about 99% identity with the amino acid 

sequence of SEQ ID NO: 2 or SEQ ID NO: 4, or a fragment thereof having GBS toxin 
receptor activity. Most preferred are polypeptides having an amino acid sequence of 
SED ID NO: 2, 4 OR 8 or a fragment thereof having GBS toxin receptor activity. 

25 Methods Of PuRincATiON 

Another aspect of the invention is a method for purifying a compound that 
binds a GBS toxin receptor, for example, natural ligand, other polysaccharides, or an 
antibody specific for the GBS toxin receptor. The method comprises providing a 
polypeptide comprising a mammalian GBS toxin receptor or fragment thereof that 

30 binds GBS toxin, contacting the polypeptide with a sample comprising the compound 
under conditions that allow specific binding of the compound to the polypeptide, and 
separating the bound compound from the remainder of the sample. The polypeptide 
can be soluble but preferably is immobilized on a substrate e.g., on a bead, membrane 
or on the surface of a cell, preferably a stable transfected cell. 
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Methods of Treatment 

GBS toxin receptor polypeptides and antibodies that interfere with GBS toxin 
binding can be used in a method of treatment of the human or animal body. For 
example, such inhibitors of GBS toxin binding can be administered to a patient to 
5 treat or prevent medical conditions involving GBS toxin binding to a GBS toxin 
receptor, such as, for example, early onset disease in the neonate. 
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GBS toxin mimetics or other compounds thai bind and/or inhibit GBS toxin 
receptor, some of which can be identified by the drug discovery assays of the 
invention, can be used in a method of treatment of the human or animal body or can 
be- used for the manufacture of a medicament for the treatment or prevention of any 
of a number of medical conditions involving pathologic and/or hypoxia-driven 
angiogenesis, such as, for example, cancerous tumors, chronic inflammatory diseases, 
scarring during wound healing or repair of neural injury. 

In a preferred embodiment, such a compound exerts its therapeutic effect by 
binding GBS toxin receptor and evoking an inflammatory response, as does GBS 
toxin. Preferably, such compounds comprise a sulfhydryl, hydroxyl, or amino group 
displayed so as to be available for binding complement C3. 

In another preferred embodiment, the compound is an inhibitor of GBS toxin 
activity. Preferred inhibitors include, but are not limited to, kinase inhibitors, single 
chain antibodies specific for the GBS toxin receptor, and antisense polynucleotides 
that specifically hybridize under high stringency conditions to a GBS toxin receptor 
nucleic acid sequence, such as that of SEQ ID N0:1, SEQ ID N0:3 or SEQ ID N0:7. 

In another preferred embodiment, the compound exerts its therapeutic effect 
without evoking an inflammatory response. The compound can be used to deliver a 
cytotoxic agent to tissue in close proximity to a cell expressing a GBS toxin receptor, 
such as, for example, a tumor undergoing angiogenesis. Preferably, the compound is 
covalently attached to a cytotoxic agent and can be associated non-covalently with a 
cytotoxic agent, such as, for example, on the external surface of a liposome, micelle, 
or other lipophilic drug encapsulating structure. Preferred cytotoxic agents include 
antineoplastic agents conmionly known in the art, such as, for example, 
mechlorethamine, chlorambucil, cyclophosphamide, melphalan, ifosfamide, and other 
alkylating agents, methotrexate and other folate antagonists, 6-mercaptopurine and 
other purine antagonists, 5-fluorouracil and other pyrimidine antagonists, cytarabine, 
©vinblastine, vincustine, and other vincas, etoposide and other podophyllotoxins, 
doxorubicin, bleomycin, mitomycin, and other antibiotics, carmustine, lomustine and 
other nitrosureas, cisplatin, interferon, asparaginase, tamoxifen, flutamide, and taxol. 
Other preferred biologic agents include sense and/or antisense RNA or DNA 
sequences derived firom specific tumor promoter or suppressor genes, such as, for 
example, the p53 and TGF gene families, signal transduction protein family members 
such as, for example, ras and myc, and growth factor receptor kinases such as, for 
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example flt2 and flkl. Tail, Tai2, and neuropholin, and other genes implicated in 
neoplastic disease and other diseases driven by pathologic angiogenesis. 

In another embodiment, GBS toxin receptor polypeptide or fragment thereof 
can be administered to a subject as a decoy to reduce the amount of stimulation of the 
5 GBS toxin receptor present in afflicted tissues (e.g., tumor tissues), thereby reducing 
cellular responses leading to proliferation and migration of cells of the afflicted 
tissues. Preferably, the GBS toxin receptor polypeptide or fragment is administered in 
soluble form, even more preferably sans transmembrane domains. 

1 0 Pharmaceutical Compositions 

Polypeptides of the invention that comprise a domain essential for GBS toxin 
binding that have the desired characteristics for bioavailability, stability and other 
important parameters of pharmacokinetics in vivo can be used as a competitive 
inhibitor of GBS toxin binding for medical conditions, such as, for example, early 

1 5 onset disease in the neonate, in which GBS toxin binding is undesirable. Appropriate 
polypeptides can include fragments having an amino acid sequence corresponding to 
a partial or full sequence of SEQ ID NO: 2 or SEQ ID NO: 4 or analogs thereof 

Compounds determined by assays using the polypeptides of the invention to 
bind and/or GBS toxin receptor and/or induce an inflammatory response, and that 

20 have the desired pharmacokinetic characteristics, can be used as treatments for 
medical conditions in v^hich GBS toxin binding can be therapeutic, such as, for 
example, medical conditions involving pathologic or hypoxia-driven angiogenesis or 
nep vascularization. 

Phannaceutical compositions of the invention include a pharmaceutically 

25 acceptable earner that may contain a variety of components that provide a variety of 
functions, including regulation of drug concentration, regulation of solubility, 
chemical stabilization, regulation of viscosity, absorption enhancement, regulation of 
pH, and the Hke. For example, in water soluble formulations the pharmaceutical 
composition preferably includes a buffer such as a phosphate buffer, or other organic 

30 acid salt, preferably at a pH of between about 7 and 8. Other components may 
include antioxidants, such as ascorbic acid, hydrophilic polymers, such as, 
monosaccharides, disaccharides, and other carbohydrates including cellulose or its 
derivatives, dextrins, chelating agents, such as EDTA, and like components well 
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known to those in the pharmaceutical sciences, e.g. Remington's Pharmaceutical 
Science, latest edition (Mack Publishing Company, Easton, PA). 

An effective amount of an active compound such as a GBS toxin receptor 
polypeptide, mimetic or analog, or GBS toxin mimetic or analog for particular 
applications depends on several factors, including the chemical nature of the 
polypeptide, mimetic or analog, the disorder being treated, the method of 
administration, and the like. Preferably, an effective amount will provide a 
concentration of polypeptide or mimetic of between about 0.0001 to 100 /m/ at the 
target GBS toxin receptor on a cell surface, more preferably less than 10 ^m, with less 
than 1 iM being most preferred. 

The active compound can be administered to a mammalian host in a variety of 
forms, i.e., they may be combined with various pharmaceutically acceptable inert 
carriers in the form of tablets, capsules, lozenges, troches, hard candies, powders, 
sprays, elixirs, syrups, injectable or eye drop solutions, and the like depending on the 
chosen route of administration, e.g., orally or parenterally. Parenteral administration 
in this respect includes administration by the following routes: intravenous, 
intramuscular, subcutaneous, intraocular, intrasynovial, transepithelial (including 
transdermal, ophthalmic, sublingual and buccal), topical (including ophthalmic, 
dermal, ocular, rectal, nasal inhalation via insufflation and aerosol), and rectal 
systemic. 

The active compound may be orally administered, for example, with an inert 
diluent or with an assimilable edible carrier, it may be enclosed in hard or soft shell 
gelatin capsules, compressed into tablets, or incorporated directly with the food of the 
diet. For oral therapeutic administration, the active compound may be incorporated 
with excipient and used in the form of ingestible tablets, buccal tablets, troches, 
capsules, elixirs, suspensions, syrups, wafers, and the like. Such compositions and 
preparations should contain at lease 0.1% of active compound. The percentage of the 
compositions and preparations may, of course, be varied and may conveniently be 
between about 2% to about 6% of the weight of the unit. The amount of active 
compound in such therapeutically useful compositions is such that a suitable dosage 
will be obtained. Preferred compositions or preparations according to the present 
invention are prepared so that an oral dosage unit form contains between about 1 and 
1 000 mg of active compound. 
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Tablets, troches, pills, capsules and the like may also contain the following: a 
binder such as polyvinylpyrrolidone, gum tragacanth, acacia, sucrose, com starch or 
gelatin; an excipient such as calcium phosphate, sodium citrate and calcium 
carbonate; a disintegrating agent such as com starch, potato starch, tapioca starch, 
5 certain complex silicates, alginic acid and the like; a lubricant such as sodium lauryl 
sulfate, talc and magnesium stearate; a sweetening agent such as sucrose, lactose or 
saccharin; or a flavoring agent such as peppermint, oil of wintergreen or cherry 
flavoring. Solid compositions of a similar type are also employed as fillers in soft and 
hard-filled gelatin capsules; preferred materials in this connection also include lactose 

1 0 or milk sugar as well as high molecular weight polyethylene glycols. When the 
dosage unit form is a capsule, it may contain, in addition to materials of the above 
type, a liquid carrier. Various other materials may be present as coatings or to 
otherwise modify the physical fomi of the dosage unit. For instance, tablets, pills, or 
capsules may be coated with shellac, sugar or both. A syrup or elixir may contain the 

15 active compound, sucrose as a sweetening agent, methyl and propylparabens as 
preservatives, a dye, flavoring such as cherry or orange flavor, emulsifying agents 
and/or suspending agents, as well as such diluents as water, ethanol, propylene glycol, 
glycerin and various combinations thereof. Of course, any material used in preparing 
any dosage unit form should be pharmaceutically pure and substantially non-toxic in 

20 the amounts employed. In addition, the active compound may be incorporated into 
sustained-release preparations and formulations. 

The active compound may also be administered parenterally or 
intraperitoneally. For purposes of parenteral administration, solutions in sesame or 
peanut oil or in aqueous propylene glycol can be employed, as well'as sterile aqueous 

25 solutions of the corresponding water-soluble, alkali metal or alkaline-earth metal salts 
previously enumerated. Such aqueous solutions should be suitably buffered, if 
necessary, and the liquid diluent first rendered isotonic with sufficient saline or 
glucose. Solutions of the active compound as a free base or a pharmacologically 
acceptable salt can be prepared in water suitably mixed with a surfactant such as 

30 hydroxypropylcellulose. A dispersion can also be prepared in glycerol, liquid 

polyethylene glycols and mixtures thereof and in oils. Under ordinary conditions of 

storage and use, these preparations contain a preservative to prevent the growth of 

microorganisms. These particular aqueous solutions are especially suitable for 

intravenous, intramuscular, subcutaneous and intraperitoneal injection purposes. In 
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this connection, the sterile aqueous media employed are all readily obtainable by 
standard techniques well-known to those skilled in the art. 

The pharmaceutical forms suitable for injectable use include sterile aqueous 
solutions or dispersions and sterile powders for the extemporaneous preparation of 
sterile injectable solutions or dispersions. In all cases the form must be sterile and 
must be fluid to the extent that easy syringability exists. It must be stable under the 
conditions of manufacture and storage and must be preserved against the 
contaminating action of microorganisms such as bacteria and fungi. The carrier can 
be a solvent or dispersion medium containing, for example, water, ethanol, polyol (for 
example, glycerol, propylene glycol, liquid polyethylene glycol and the like), suitable 
mixtures thereof, and vegetable oils. The proper fluidity can be maintained, for 
example, by the use of a coating such as lecithin, by the maintenance of the required 
particle size in the case of a dispersion and by the use of surfactants. The prevention 
of the action of microorganisms can be brought about by various antibacterial and 
antifungal agents, for example, parabens, chlorobutanol, phenol, sorbic acid, 
thimerosal and the like. In many cases it will be preferable to include isotonic agents, 
for example, sugars or sodium chloride. Prolonged absorption of the injectable 
compositions can be brought about by use of agents delaying absorption, for example, 
aluminum monostearate and gelatin. 

Sterile injectable solutions are prepared by incorporating the active compound 
in the required amount in the appropriate solvent with various of the other ingredients 
enumerated above, as required, followed by filtered sterilization. Generally, 
dispersions are prepared by incorporating the sterilized active ingredient into a sterile 
vehicle which contains the basic dispersion medium and the required other ingredients 
firom those enumerated above. In the case of sterile powders for the preparation of 
sterile injectable solutions, the preferred methods of preparation are vacuum drying 
and the freeze drying technique which yield a powder of the active ingredient plus any 
additional desired ingredient from the previously sterile-filtered solution thereof 

For purposes of topical administration, dilute sterile, aqueous solutions 
(usually in about 0.1% to 5% concentration), otherwise similar to the above parenteral 
solutions, are prepared in containers suitable for drop-wise administration to the eye. 
The compounds of this invention may be administered to a mammal alone or in 
combination with pharmaceutically acceptable carriers. As noted above, the relative 
proportions of active ingredient and carrier are determined by the solubility and 
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chemical nature of the compound, chosen route of administration, the particular 
compound chosen and the physiological characteristics of the particular patient under 
treatment. 

5 Kits 

Yet another aspect of the invention is a kit for use in carrying out any of the 
above methods. A preferred embodiment is a kit comprising a GBS toxin receptor or 
fragment thereof. Preferably, the receptor or fragment is immobilized. A preferred 
kit can be used for identifying a compound that binds to GBS toxin receptor, and 

10 comprises at least one cell that expresses GBS toxin receptor. 

Another embodiment is a kit for monitoring tumor growth or metastasis, 
comprising a reagent for detecting expression of a GBS toxin receptor. Examples of 
such reagents include, but are not limited to, polynucleotide probes that hybridize to a 
GBS toxin receptor nucleic acid sequence and compounds that bind to a GBS toxin 

15 receptor, such as, for example, an antibody that specifically recognizes GBS toxin 
receptor, a GBS toxin, a GBS toxin mimetic, or other compounds identified by the 
screening methods described above. 
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A third embodiment is a kit for purifying a compound that binds a GBS toxin 
receptor, comprising a GBS toxin receptor or fragment thereof that binds the 
compound. Preferred compounds include GBS toxin, GBS toxin mimetics, antibodies 
that specifically bind GBS toxin receptor, and other compounds identified by the 
5 screening methods described above. 

Additional kit components can include, but are not limited to, additional 
reagents required for detection, a reference standard(s), instructions for use, and the 
like. Suitable reference standards include positive controls, negative controls, 
photographs of such controls, tabulated or graphed data of such controls, and the like. 
1 0 The kits may further comprise instructions for carrying out the methods described 
above, preferably printed instructions. 



EXAMPLE?) 

Example 1 • Cloning Sheep GBS Toxin Receptor 

15 Primary culture of sheep lung endothelial cells 

Small pieces of primary lung tissues from a 7-week old sheep are cut 
into small pieces in Hank's balanced salt solution (HBSS) containing 10 mM HEPES 
buffer (Life Technology), 1% penicillin/streptomycin and 0.1% gentamycin, and are 
cultured in sheep limg complete medium (Life Technology) at 37°C. After one week 

20 of the culture, clones of sheep lung endothelial cells are identified by Cobblestone 
morphology and harvested into 24-well tissue culture plates (Falcon) using cloning 
rings. When the cells are confluent, they are detached by pancreatin and transferred 
to a 60-mm tissue culture Petri dish or a T-25 tissue culture flask (Falcon). When 
they are confluent again, they are split and cultured into a few lOO-mm tissue culture 

25 plates (Falcon). Each split is considered to be one passage. The same procedure is 
repeated until enough cells (-10^) are obtained for isolation of mRNA, 
Isolation of mRNA and construction of cDNA lihrayy 
Poly(A)+ RNA is isolated from 9.2 x 10^ sheep lung endothelial cells 
(passage 8 and 9) by a standard method (Pharmacia). A total of 16 ^ig poly(A)'' RNA 

30 is acceptable amount obtained. 2.5 (ig mRNA can be used to construct a cDNA 

library. Poly(A)'' RNA is oligo(dT)-primed (with Not I restriction site) and converted 
into double-stranded cDNA. After adding a BstXVEcoR I adaptor, the cDNA is 
unidirectionally cloned into the BstXl and Not I sites of pCDNA3. 1(+) (Invitrogen). 
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E. coli Topi OF' (Invitrogen)'is used as a host strain for amplification. 5.38 x 10^ 
primary clones are an acceptable number generated. The library is amplified by 
plating cells onto fifty large LB agar plates containing ampicillin (100 |ig/ml). The 
plates are scraped and aliquoted so that each aliquot represents 10 plates. DNA is 
5 purified by Qiagen Max columns (Qiagen). 

Screening of cDNA library for a gene encoding GV^R fpyin refi-.f^p rnr 
To screen a cDNA library for a gene encoding GBS toxin receptor 
gene, a unique colorimetric method is used. Five |ig plasmid DNA fi-om each pool of 
cDNA library is used to transfect C0S7 cells. The transfected cells are cultured in 

10 four to eight 96-well tissue culture plates (Falcon) for transient expression. Each well 
contains about 20,000 transfected cells in DMEM medium (Life Technology). C0S7 
cells transfected with pCDNA3.1(+) are used as a control. After 3 days expression, 
the medium is carefiilly removed. Each well is rinsed 3 times with HPSS buffer 
containing Mg^"" and Ca^"" (wash buffer) (Life Technology). 

15 The cells are then incubated with biotinylated toxin (50/^/ per well; 1 to 

1 .5 /ig/ml) at room temperature for 1 h. After the hour incubation, the biotinylated 
toxin is discarded and the wells are rinsed 3 times with the wash buffer. The cells are 
incubated with streptavidin-B-gal solution and each well is rinsed 3 times with the 
wash buffer. The cells are then incubated with PNPG (50 \x\ per well; 1 mg/ml in 

20 substrate buffer) at 3TC, Absorbance at 405 nm is measured by an ELISA reader at 
1 and 20 h, respectively. The cells which give the highest OD are harvested. Plasmid 
DNA is isolated by Hirt extraction. Plasmid DNA is amplified in £. coli to have 
enough DNA for the next transfection (enrichment). 

Enrichment is done 8 times by this colorimetric method. The number 

25 of the transfected cells loaded into each well is gradually decreased in the last few 
enrichments and untransfected cells are added to each well to give a total number of 
20,000 cells per well for the cells to be confluent and to reduce background after 3 
days* expression. At the last enrichment, each well has only 1 to 10 transfected cells. 
Cells giving the highest OD are harvested. DNA is isolated and amplified in E, coli. 

30 A number of isolated clones are individually assayed by this 

colorimetric method. The clones which showed higher binding to CM 101 are 
sequenced. 

Sequence analysis 
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DNA sequence analysis of clone pFU102, which has a 2.1kb insert, 
revealed a sequence encoding a partial integral glycoprotein. N-terminal sequence 
was obtained by 5 'RACE method (Life Technology) and a full-length ^ene is 
designated as SP55. Triple ligation yielded pCD55, which contains an entire coding 
5 region of SP55. 

mRNA for the SP55 has 2844 nucleotides, encoding a protein of 495 amino 
acids with a predicated mass of 55 KDa, SP55. Analysis by the method of Klein et al. 
(Klein et al, Biochim Biophys Acta, 815:468-476 (1985)) classifies SP55 as an 
integral protein with seven transmembrane segments. SP55 has both N-glycosylation 
10 and kinase phosphorylation sites. A Swiss-Prot. search of SP55 did not reveal any 
high homology to known human proteins. However, SP55 has some identity 30%) 
to renal sodium-dependent phosphate transporters from human, rabbit, mouse and rat. 
In addition, SP55 has some identity (- 30 to 39%) to hypothetical proteins (HYP50 
and HYP63) from C elegans. 

15 

Example 2 - Cloning Human GBS Toxin Receptor 

The sheep GBS toxin receptor sequence shares about 37% identity 
with HYP50 and about 33% identity to HYP63, two hypothetical proteins from C 
elegans. In the regions corresponding to amino acid residues 180-186 and 443-449 of 

20 SEQ ID No. 2, five amino acids within a seven amino acid stretch are absolutely 
conserved among the three proteins. 

A first degenerate oligonucleotide, CMR3-S: 5*- 
CGGGATCCCGCCNGCNATGCAYRSHRTSTGG-3XSEQ ID No. 5), was designed 
to include all possible codons encoding the aniino acid sequences of SP55, HYP50, 

25 and HYP63 in the 180-186 region. A second degenerate oligonucleotide, CMR4- 
AS2: 5'-GGAATTCCDGGDGCRATKTCNARRTRRTT-3' (SEQ ID No. 6), was 
designed to include the complementary sequences of all possible codons encoding the 
amino acid sequences of SP55, HYP50, and HYP63 in the 443-449 region. 

Polymerase chain reaction (PGR) was conducted using these 

30 oligonucleotides and a human embryo lung cDNA library as a template. The reaction 
yielded three overlapping sequences approximately 400 bp in size, which encompass 
part of the nucleic acid sequence of SEQ ID No. 3. These sequences were then used 
as probes to clone the remainder of the gene, referred to herein and HP59 (SEQ ID 
NO: 7). 
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10 



Example 3 - Preparation of Antibodies against GBS Toxin Receptor 

Rabbits are immunized with the synthetic peptides shown in Table 8. 
A-1 mg/ml solution of peptide plus KLH in 0.0 IM phosphate buffer is prepared. For 
the first immunization, 200 |ig of peptide plus KLH (200 |il) and an equal volume of 
Freunds complete adjuvant, emulsified well before injection, is injected into 3-4 spots 
on the dorsal surface about the neck and shoulders of a rabbit. After two weeks, the 
second immunization (boost) is given at the same concentration of immunogen, but 
emulsified in Freunds incomplete adjuvant. The boost is delivered in the same region 
of the body. After another two weeks, blood is collected and assayed by ELISA for 
response against the peptide without KLH. Further boosts are given to improve 
antibody titer, if necessary. 



15 



Table 8 
Immunogenic Peptides 



Peptide Amino Acid Sequence 

p56a APSDGEEGSDRTPLLQRAPRAEPAPVC 27 aa 



SEP ID m. 



p55a 
p57a 



LAPSDGEEGSDRTPL 



NTTAKDNRTSYECA 



20 



residues 8-35 of 
SEQ ID N0:4 

15aa residues 7-22 of 
SEQ ID NO: 4 

14aa residues 71-84 of 
SEQ ID NO: 4 

Peptide p55 is a fragment of an extracellular domain of GBS toxin receptor. 
Peptide p57a is a fragment of an intracellular domain of GBS toxin receptor. Animals 
immunized with these peptides produce polyclonal antibodies Pab55 and Pab57, 
respectively. 



25 



Example 4 - Detection of GBS Toxin Receptor Expression in Tumor 
Cells 

This example shows that GBS toxin receptor can be detected in tumor cells. 
Immunohistochemistry is performed on paired human and mouse tissues of normal or 
tumor origin, using rabbit polyclonal antibodies Pab 55 and Pab 57, 
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Mouse and human tumor tissues are fixed in 10% neutral formalin. The 
tissues are then dehydrated, paraffin embedded and 1 0-20 x 8-micron sections are cut 
for immunohistochemical staining. 

Immunohistochemical analysis is performed with the automated Ventana 
5 Immunohistochemical Stainer according to the manufacturer's suggested protocol 
(Ventana. Tucson, Arizona). Sections are deparaffmated with xylene. The prepared 
sections are then treated with 1% hydrogen peroxide prepared in 30% aqueous 
methanol for 20 minutes at room temperature to quench endogenous peroxidase 
activity. The slides are then washed with PBS, blocked with 5% BSA and 5%, goat 

10 serum in PBS, washed again and then incubated for 30 minutes at 37°C with the 

appropriate diluted (1 : 100) antibody. Horseradish peroxidase-labeled goat anti-rabbit 
IgG is used as a secondary antibody. For visualization, the sections are incubated 
with DAB/H2O2. The sections are finally incubated with a copper enhancer (Ventana) 
for 4 minutes, washed, counterstained with hematoxylin, and mounted in toluene- 

15 minus mounting medium. Photographic documentation is performed and images are 
stored for later review and analysis. The results are summarized in Table 9. The 
numbers refer to glass slides. 

TABLE 9 

20 Immunohistochemistry of tumor and normal tissues 

(diff, = differentiated) 

Human tissues: 





Antibody 


Magnification 


Signal 


1 . Ovary tumor ( 95-02 VOI 6) high grade 


Pab55 


400x 


_ + 


papillary carcinoma 






2. Ovary tumor ( 95-02VO16) high grade 


Pab55 


400x 


+ 


papillary carcinoma 








3. Normal ovary ( 96-08ZO08) control 


Pab 55 


400x 




tissue 








4. Ovary tumor ( 95-02VO16) high grade 


Pab57 


400x 




papillary carcinoma 








5. Ovary tumor ( 9 5 -02 VO 16) high 


Pab 57 


400x 




grade papillary carcinoma 






6. Ovary tumor ( 95-02VO16) high grade 


Pab 57 


400x 


+ 


papillary carcinoma 








7. Normal ovary 96-08ZO08) control 


Pab 57 


400x 




8. Colon cancer 95-14664) poorly diff. 


Pab 55 


400x 


+ 


Adenocarcinoma 








9. Normal colon 9708VO08) control 


Pab 55 


400x 




10. Colon cancer 95-14664) poorly diff. 


Pab 57 


400x 


+ 


Adenocarcinoma 
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1 1. Colon cancer 95-14664) poorly diff. 


Pab57 


400x 


J 


Adenocarcinoma 








12. Normal colon 9708 VO08) control 


Pab57 


400x 




13. Female breast cancer ( 97-IOV03a) 


Pab 55 


400x 


+ 


invasive mammary carcinoma 








14. Male" breast cancer (no code) 


Pab55 


400x 




mammary carcinoma 








15. Normal female breast 97-12VO20-3) 


Pab 55 


400x 




control 








16. Female breast cancer 97-10 V03a) 


Pab 57 


400x 


+ 


Invasive mammary carcinoma 








17 Male hrea*;t cancer (t\c\ coHe^ 


Pab 57 




+ 


mammary carcinoma 








18. Normal female breast (97-12VO20-3) 


Pab 57 


400x 




control 








19. Lung cancer ( 97- 1 OV022-5) poorly 


Pab 55 


400x 


4- 


diff. NOJ-small cell carcinoma 








20. Normal lung ( 98-0 1 VO 11) control 


Pab 55 






21. Lung cancer ( 97-10VO22-5) poorly 


Pab 57 


400x 




diff. NOJ-small cell carcinoma 








22. Lung cancer ( 97-10VO22-5) poorly 


Pab 57 


400x 




diff. NOJ-small cell carcinoma 








23. Normal lung ( 98-0 1 VO 11) control 


Pab 57 







Mouse Tissues: 



Antibody Magnification Signal 



24. Madison Lung Tumor (MLT) untreated Pab 55 + 
with CM 101 

25. MLT untreated with CM 101 Pab 55 + 

26. Normal mouse lung Pab 55 

27. MLT untreated with CM 101 Pab 57 + 

28. Npnnal mouse lung Pab 57 

The Pab 55 antibody stains the cells lining a blood vessel in a human ovary 

cancer tissue section, but such staining is not apparent in cells of normal human ovary 
- tissue (see FIG. 2A and 2B, respectively). Similar results are obtained with the Pab 
5 57 antibody (see FIG. 3A and 3B). As shown in the above table and in FIGS. 2A-3B, 
antibodies raised to GBS toxin receptor fragments specifically bound to tumor tissues 
but not normal tissues, suggesting that GBS toxin receptor is expressed in tumor cells 
but not normal cells. 



1 0 Example 5- Detection of GBS Toxin Receptor Expression in Mice 
Afflicted with Rheumatoid Arthritis 

This example shows that GBS toxin receptor can be detected in cells from a 
mammalian model for rheumatoid arthritis (RA). Mice with collagen-induced 
arthritis were treated with CM 101 or carrier. CMlOl reversed the inflammatory 
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damage and inhibited pannus formation. Mouse #8 and #15, which were treated with 
CMlOl, and uvo control mice (not treated with CM 101) were sacrificed for 
immunohistochemistry. 



TABLE 10 

Immunohistochemistry of Rheumatoid Arthritic Mice 

29. No CM 101 Pab55 + 

30. MOUSE 8 - 5' (vessel) Pab 55 + 

31. No CM 101 Pab 57 + 

32. MOUSE 15-5' (vessel) Pab 57 + 

33. MOUSE 8 -5' (between joint) Pab 57 + 

34. MOUSE 15 Pab 57 + 

35. No CM 101 (marrow) Pab 57 -f 

36. MOUSE 15-5' (marrow) Pab 57 + 



As shown above Pab55 and Pab57 specifically bound to pathologic 
neovasculature in the pannus, suggesting that GBS toxin receptor is expressed in mice 
afflicted with rheumatoid arthritis. No binding of CM 101 was observed in the normal 
neovasculature in the growth plate of the joints of the arthritic mice. 

Example 6 - Targeted Delivery of a Chimeric Compound to Tissues 
Expressing GBS Toxin Receptor 

This example shows the targeted delivery of a chimeric compound to tissues 
expressing GBS toxin. The chimeric compound is a CMlOl-biotin conjugate. Mice 
with Madison Lung Tumors (MLT) are infused intravenously (i.v.) with biotinylated 
CM 101. 

CMlOl has been reacted with hydrazinylated biotin to form the biotin 
hydrazone at the reducing end of the polysaccharide CMlOl. Briefly, 25 micrograms 
of lyophilized CMlOl is dissolved in 250 {il labeling buffer at 100 mM sodium 
acetate, 0.02% sodium azide. Aqueous meta-periodate (125 |al of 30 mM) is added 
and the oxidation is allowed to proceed in the dark for 30 minutes at room 
temperature. The reaction is terminated by adding 80 mM Na2S03 to the solution. 
The resultant aldehydes are reacted with 125 ^1 of 5 mM NHS-LC-Biotin (MW 
556.58) for a 1 hour incubation at room temperature to form biotinylated CMlOl . 
Excess biotin is removed by dialysis against 1 liter of PBS at 4°C four times. The 
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product is purified by gel filtration on an Ultrahydrogel 1000 HPLC. lyophilized and 
stored at -70*^C until use. 

Tissues are recovered 5 min post infusion with CM 101 and subjected to 
irftmunohistochemistry. Tumor and normal mouse tissue sections are analyzed for CM 
5 101 binding by both mouse anti-CMlOl mAb (7A3), followed by secondary mAb- 
HRP conjugate (referred to in FIG. 4B as MLT CMIOI-Biot.5' + McAb), or with 
avidin (which specifically binds biotin) conjugated with HRP (referred to in FIG. 4A 
as MLT CMIOI-Biot.5' + Strep.HRP). 

FIGS. 4A-4C depict different sections taken from the same tumor and include 

1 0 a longitudinal view of the same blood vessel approximately in the center of the 

figures. The dark staining in FIG. 4 A shows the localization of the biotin component 
in the cells lining the blood vessel. Similarly, FIG. 4B depicts the localization of the 
CMlOl component in the cells lining the blood vessel. FIG. 4C is a negative control 
that was not exposed to CM 101 . The analysis clearly shows that 7 A3 and avidin bind 

15 to the same blood vessels in tumor tissue. Thus, biotin has been delivered to the 

blood vessel of the tumor tissue by virtue of its physical association with a compound 
(CM 101) that binds the GBS toxin receptor. 

These studies show that chimeric compounds can be delivered to tissues 
undergoing pathologic and/or hypoxia-driven angiogenesis or neovascularization. As 

20 part of a chimeric compound, cytotoxic molecules can be directed to such tissues, e.g., 
tumor tissue. The cytotoxic molecule can be coupled directly to a molecule that binds 
GBS toxin receptor, e.g., GBS toxin. Alternatively, the molecule that binds GBS 
toxin receptor can be coupled to biotin and the cytotoxic molecule can be coupled to 
avidin. 
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Example 7 - Enhanced Sensitivity to GBS-Toxin-Dependent Cytotoxicity 
OF Cells Expressing GBS Toxin Receptor 

This example shows the enhanced sensitivity to GBS-toxin-dependent 
cytotoxicity of cells transfected with the GBS toxin receptor, relative to control cells. 
5 Withput being bound to a particular theory, the inventors believe that complement 
binds GBS toxin bound to the GBS toxin receptor on a cell, thereby targeting the cell 
for killing by white blood cells (WBC). 

Human bladder carcinoma cells (ECV cells), are stable transfected with the 
human GBS toxin receptor gene. The resultant cell line is ECV71 1 . Cells stable 
10 transfected with vector alone as referred to as V23. ECV 71 1 and V23 are seeded in 
96-well plates at 5,000 cells/well. 

White blood cells are collected from healthy human donors as follows. Blood 
is collected by standard phlebotomy procedures into heparinized tubes (30 U/ml) and 
centrifuged at 2000 rpm for 20 min. The interface is carefully transferred to a new 
15 tube and washed twice by centrifiigation with medium (RPMI-1640). Cells are 
resuspended in RPMI-1640 supplemented with 5% fetal bovine serum (FBS) and 
Interferon-gamma (IFN) at 100 U/ml, and incubated overnight in a 37^C, 5%C02 
incubator. The cells are then resuspended in fresh medium with 5% FBS. 

5,000 cells of the WBC preparation are added to each well containing the 
20 transfected cells. CM 101 is added to a final concentration of 1 |ig/ml to the wells 
together with human serum from matching human donors. The cells are incubated 6 
hours at 37 C. 

Cytotoxicity is assayed by measuring lactate dehydrogenase (LDH) using the 
Promega's CytoTox 96 Non-Radioactive Assay kit (Nachlas et al. (1960) Anal. 

25 Biochem 1,317; Korzeniewski et al. ( 1 983) 1 Immunol. Methods 64, 3 1 3 ; Decker et 
al. J. Immunol Methods 1 15, 61; Brander et al. (1993) Eur. J. Immunology 23, 3217; 
Behl et al. (1994) Cell 11, 817; Lappalainen et al. (1994) Pharm. Research 1 1, 1 127; 
Allen et al. (1994) Promega Notes 45, 7; Sinensky et al. (1995) ToxicoL Letters 75, 
02; Moravec (1994) Promega Notes 45, 11). Percent cytotoxicity is calculated as 

30 recommended by the manufacturer's instructions. The results are shown in Table 1 1 . 
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Table 11 



10 



Cytotoxicity 


ECV711 


V23 


WBC, IFN, C3, 
-CMlOl 


29.1% 


27.5% 


WBC, IFN, C3, 
+CM101 


40.45% 


22.46% 



incubated with CM 101, WBC and human serum (source of C3) compared to cells 
incubated without CMlOl . Control cells transfected with vector alone, V23, do not 
show a CMlOl dependent increase in cytotoxicity. 

All publications and patent applications mentioned in this specification are 
herein incorporated by reference to the same extent as if each individual publication 
or patent application was specifically and individually indicated to be incorporated by 
reference. 

The invention now being fully described, it will be apparent to one of ordinary 
skill in the art that many changes and modifications can be made thereto without 
departing from the spirit or scope of the appended claims. 
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1. An isolated polynucleotide at least 10 bases in length comprising a nucleic 
acid sequence encoding, or complementary to a nucleic acid sequence encoding, a 
mammalian receptor for group B p-hemolytic Streptococcus toxin (GBS toxin 

5 receptor), or a polypeptide fragment thereof. 

2. The polynucleotide of Claim 1 , wherein the nucleic acid sequence 
comprises SEQ ID NO: 9. 

3. The polynucleotide of Claim 1, wherein the nucleic acid sequence has 
100% identity to a nucleic acid sequence selected from the group consisting of 

1 0 residues 61 to 1 542 of SEQ ID NO: 1 , residues 266 to 1 870 of SEQ ID NO: 7, and 
residues 87 to 1 568 of SEQ ID NO: 3. 

4. The polynucleotide of Claim 1 , wherein the polynucleotide is hybridizable 
under high stringency conditions to the nucleic acid sequence of SEQ ID NO: 7. 

5. A vector comprising the polynucleotide of Claim 1 . 
15 6. A host cell transformed with the vector of Claim 5. 

7. A process for producing a mammalian GBS toxin receptor or fragment 
thereof, comprising culturing the host cell of Claim 6 in a suitable culture medium. 

8. An isolated polypeptide comprising a mammahan GBS toxin receptor or 
fragment thereof. 

20 9. The polypeptide of Claim 8, wherein the receptor has at least about 86% 

identity to the corresponding amino acid sequence of SEQ ID NO: 2. 

10. The polypeptide of Claim 8, wherein the receptor or fragment has 100% 
identity to the corresponding region of the amino acid sequence of SEQ ID NO: 4 or 
SEQ ID NO: 8. 

25 11. The polypeptide of Claim 8, wherein the polypeptide is encoded by a 

nucleic acid sequence hybridizable under high stringency conditions to a nucleic acid 
sequence selected from the group consisting of: 
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a) nucleotides 61 to 1542 of SEQ ID NO: 1, and 

b) nucleotides 87 to 1568 of SEQ ID NO: 3. 



12. All isolated polypeptide comprising an amino acid sequence that differs 
from an amino acid sequence selected from the group consisting of SEQ ID NO:2, 
5 SEQ ID N0:4 and SEQ ID N0:8 at no more than about 20% of the amino acid 
residues. 



13. The isolated polypeptide of Claim 12, wherein the amino acid sequence of 
said isolated polypeptide differs from the amino acid sequence selected from said 
group by one amino acid residue. 



14, The isolated polypeptide of Claim 12, wherein the different amino acid 
residues are conservative substitutions of the corresponding residues of the amino 
acid sequence selected from said group. 



15. An isolated polypeptide comprising an amino acid sequence of the 
formula: 



15 AAl-AAn-AAm 

wherein: 

AAl is absent or is M; 

AAn is a contiguous chain of 0 to 100 amino acids, preferably of 0 or 41 
amino acids, even more preferably of residues 2-42 of SEQ ID NO: 8; and 

20 AAm is a contiguous chain of 494 amino acids comprising AA43 through 

AA536, wherein: 

(1) each of AA43, AA47, AA51, AA52, AA57, AA58, AA65, AA66, 
AA72, AA85, AA87, AA93, AA94, AA96, AA115, AA116, AA122, AA123, 
AA125, AA134, AA143, AA173, AA174, AA178, AA185, AA186, AA189, AA190 

25 AA196, AA200, AA204, AA206, AA207, AA220, AA253, AA260, AA276 AA277' 
AA280, AA283, AA287, AA294, AA295, AA298, AA300, AA301, AA312 AA324' 
AA326, AA360, AA365, AA373, AA374, AA379, AA396, AA403, AA407i AA418' 
AA480, AA483, AA486, AA491, AA494, AA502, AA528, AA529, AA532 and 
AA536 is an amino acid residue corresponding to: 

30 (a) residue 43, 47, 51, 52, 57, 58, 65, 66, 72, 85, 87, 93, 

94, 96, 115, 116, 122, 123, 125, 134, 143, 173, 174, 178, 185, 186, 189, 190 196 
200, 204, 206, 207, 220, 253, 260, 276, 277, 280, 283, 287, 294, 295, 298, 300, 301 
312, 324, 326, 360, 365, 373, 374, 379, 396, 403, 407, 418, 480, 483, 486, 491 494' 
502, 528, 529, 532 and 536, respectively, of SEQ ID N0:8; 

35 (b) residue 2, 6, 10, 11,16, 17, 24, 25, 31, 44, 46, 52, 

53, 55, 74, 75, 81, 82, 84, 93, 102, 132, 133, 137, 144, 145, 148, 149, 155, 159 163 
165, 166, 179, 212, 219, 235, 236, 239, 242, 246, 253, 254, 257, 259, 260, 27l! 283* 
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285, 319, 324, 332, 333, 338. 355, 362, 366, 377, 439, 442, 445, 450, 453, 461, 487, 
488, 491 and 495, respectively of SEQ ID N0:4; or 

(c) a conservative substitution thereof; 

(2) each of AA44-AA46, AA48-AA50, AA53-AA56, AA59-AA64 
5 AA67-AA71, AA73-AA84, AA86, AA88-AA92, AA95, AA97-AA1 14, AAl 17- ' 

AA121, AA124, AA126-AA133, AA135-AA142, AA144-AA172, AA175-AA177 
AA179-AA184, AA187-AA188, AA19I-AA195, AA197-AA199, AA201-AA203 ' 
AA205, AA208-AA219, AA221-AA252, AA254-AA259, AA261-AA275, AA278- 
AA279, AA281-AA282, AA284-AA286, AA288-AA293, AA296-AA297, AA299 
10 AA302-AA311, AA313.-AA323, AA325, AA327-AA359, AA361-AA364, AA366- 
AA372, AA375-AA378, AA380-AA395, AA397-AA402, AA404-AA406, AA408- 
AA417, AA419-AA478, AA481-AA482, AA484-AA485, AA487-AA490, AA492- 
AA493, AA495-AA501, AA503-AA527, AA530.AA531 and AA533-AA535 is 

(a) residue 44^6, 48-50, 53-56, 59-64, 67-71, 73-84, 
15 86,88-92,95,97-114, 117-121, 124, 126-133, 135-142, 144-172, 175-177, 179-184 

187-188, 191-195, 197-199, 201-203, 205, 208-219, 221-252, 254-259, 261-275, 278- 
279, 281-282, 284-286, 288-293, 296-297, 299, 302-311, 313-323, 325, 327-359, 
361-364, 366-372, 375-378, 380-395, 397-402, 404-406, 408-417, 419-478, 481-482 
484-485, 487-490, 492-493, 495-501, 503-527, 530-531 and 533-535, respectively, of 
20 SEQIDNO:8;or 

(b) a conservative substitutions thereof; and 

(3) one or more of AA3 15 through AA367 are optionally absent. 

16. An antibody that recognizes a mammalian GBS toxin receptor or fragment 

thereof. 



25 17. An isolated complex comprising a GBS toxin bound to a mammalian GBS 

toxin receptor or fragment thereof. 

1 8. A method of forming a complex comprising: 

contacting a GBS toxin with a polypeptide comprising a mammalian 
GBS toxin receptor, or fragment thereof that can bind GBS toxin, under conditions 
30 that permit specific binding of the GBS toxin to the polypeptide, and 
allowing the complex to form. 

19. A method for purifying a compound that binds a GBS toxin receptor, 
which method comprises: 

providing a polypeptide comprising a manmialian GBS toxin receptor 
35 or fragment thereof that binds GBS toxin; 

contacting said polypeptide with a sample comprising the compound 
under conditions that allow specific binding of the compound to the polypeptide; and 
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separating the bound compound from the remainder of the sample. 

20. A method of determining the presence or absence of GBS toxin in a 
sample, which method comprises: 

contacting the sample with a polypeptide comprising a mammalian 
5 GBS toxin receptor, or fragment thereof that can bind GBS toxin, under conditions 
that permit specific binding of the GBS toxin to the polypeptide, 

and determining whether specific binding has occurred. 

21 . A method for diagnosing early onset disease in a neonate comprising 
performing the method of Claim 20, wherein the sample is obtained from the neonate 

1 0 and wherein presence of the GBS toxin is indicative of early onset disease. 

22. A method for detecting pathologic vasculature in a mammalian tissue, 
which method comprises detecting the presence of a GBS toxin receptor. 

23. A method for identifying a compound which inhibits binding of a GBS 
toxin to a mammalian GBS toxin receptor, comprising: 

15 combining a test compound with a polypeptide comprising a 

mammalian GBS toxin receptor, or fragment thereof that can bind GBS toxin, in a 
reaction mixture containing GBS toxin and under conditions that permit specific 
binding of the GBS toxin to the receptor or fragment, and 

determining the amount of inhibition by the compound of the binding 
- 20_ of the GBS toxinjo the polypeptide. 

24. An inhibitor of binding of a GBS toxin to a mammalian GBS toxin 
receptor. 

25. A method for identifying a compound which specifically binds a 
mammalian GBS toxin receptor, comprising: 

25 combining a test compound with a polypeptide comprising a 

mammalian GBS toxin receptor or fragment thereof that can bind GBS toxin, under 
conditions that allow specific binding to occur, and 

detecting a complex formed between said test compound and said 
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26. A method for determining cytotoxicity of a test chimeric compound, which 
rnethod comprises: 

exposing a cell expressing, on the cell surface, a mammalian GBS 
5 toxin receptor, or fragment thereof that binds GBS toxin, to a test chimeric compound 
comprising a cytotoxic agent coupled to said GBS toxin; and 
detecting signs of cytotoxicity, 

27. A chimeric compound comprising a cytotoxic agent covalently linked to a 
molecule that specifically binds a mammalian GBS toxin receptor, 

10 28. A method for identifying an inhibitor of GBS toxin receptor, which 

method comprises: 

incubating test cells in the presence and absence of a test compound 
and under conditions in which the cells incubated in the absence of the test compound 
can proliferate or migrate, wherein the test cells express GBS toxin receptor or a 
1 5 fragment thereof having GBS toxin receptor activity; and 

comparing the proliferation or migration of the test cells incubated in 
the presence of the test compound to the proliferation or migration of the test cells 
incubated in the absence of the test compound, wherein less proliferation or migration 
in the presence of the test compound is indicative of the test compound being an 
20 inhibitor of the GBS toxin receptor. 

29. A method for identifying an inhibitor of endothelial cell proliferation or 
migration, which method comprises: 

incubating test endothehal cells in the presence and absence of a test 
compound and under conditions in which the cells incubated in the absence of the test 
25 compound can proliferate or migrate, wherein the test cells express GBS toxin 
receptor or a fragment thereof having GBS toxin receptor activity; and 

comparing the proliferation or migration of the test cells incubated in 
the presence of the test compound to the proliferation or migration of the test cells 
incubated in the absence of the test compound, wherein less proliferation or migration 
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in the presence of the test compound is indicative of the test compound being an 
inhibitor of the endothelial cell proliferation or migration. 



30. A method for identifying a therapeutic compound for the treatment or 
prevention of a medical condition characterized by pathologic angiogenesis or 
5 neovascularization, which method comprises: 

incubating test cells in the presence and absence of a test compound, 
w^herein the test cells express GBS toxin receptor or a fragment thereof having GBS 
toxin receptor activity; 

comparing the proliferation or migration of the test cells incubated in 
10 the presence of the test compound to the proliferation or migration of the test cells 
incubated in the absence of the test compound, wherein less proliferation or migration 
in the presence of the test compound is indicative of the test compound being a 
candidate therapeutic compound for the treatment or prevention of the medical 
condition. 

15 31. The method of Claim 30, wherein the medical condition is a cancerous 

tumor. 

32. The method of Claim 30, wherein the medical condition is a reperfusion 

injury. 

33. The method of Claim 30, wherein the medical condition is scarring during 
20 wound healing. 

34. The method of Claim 30, wherein the medical condition is keloids. 

35. The method of Claim 30, wherein the medical condition is a chronic 
inflammatory disease. 

36. The method of Claim 30, wherein the medical condition is neural injury. 

25 37. A method for identifying a compound which inhibits binding of a GBS 

toxin to a mammalian GBS toxin receptor, comprising: 

(a) simulating and selecting the most probable conformations of a 
mammalian GBS toxin receptor, 
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(b) designing a chemically modified analog that substantially mimics 
the energetically most probable three-dimensional structure of the polypeptide, 

(c) chemically synthesizing the analog, and 

(d) evaluating the bioactivity of the analog. 

38. A method for identifying a compound which binds to a mammalian GBS 
toxin receptor, comprising: 

(a) simulating and selecting the most probable conformations of a 
mammalian GBS toxin receptor, 

(b) deducing the most probable binding domains of the polypeptide, 

(c) designing a compound that would form the energetically most 
probable complexes with the polypeptide, 

(d) chemically synthesizing the compound, and 

(e) evaluating the bioactivity of the compound. 

39. A method for the prevention or treatment of neonatal onset disease in a 
human neonate, comprising administering an inhibitor of binding of GBS toxin to a 
human GBS toxin receptor. 

40. A method for inhibiting pathologic or hypoxia-driven endothelial cell 
proliferation or migration in a mammalian tissue, which method comprises 
specifically binding a molecule to a GBS toxin receptor present on the surface of at 
least one cell in the tissue, the molecule being selected fi:^om the group consisting of: 

' " ' a compound that can evoke an inflammatory response when bound to a 
GBS toxin receptor in a mammal; 

a chimeric compound comprising a cytotoxic compound coupled to a 
compound that specifically binds the GBS toxin receptor; 

an inhibitor of GBS toxin receptor phosphorylation; and 

an inhibitor of GBS toxin receptor activity. 

41. A pharmaceutical composition comprising a pharmaceutically effective 
amount of a molecule selected firom the group consisting of: 

a GBS toxin receptor or fi-agment thereof; 

68. 



wo 00/05375 PCT/US99/16676 
an inhibitor of a GBS toxin receptor; and 

a chimeric compound comprising a cytotoxic agent coupled to a 
compound that binds GBS toxin receptor, 
and a pharmaceutically acceptable carrier. 

5 

42. A kit comprising a component selected from the group consisting of: 

a GBS toxin receptor or fragment; 

a reagent for detecting the presence of a GBS toxin receptor or 

fragment; and 

10 a reagent for detecting the presence of polynucleotide encoding the 

GBS toxin receptor or fragment. 

43. A molecule for use in a method of treatment of the human or animal body, 
said molecule being selected from the group consisting of: 

a GBS toxin receptor or fragment thereof for use in a method of 
15 treatment of the human or animal body, said molecule being selected from the group 
consisting of: 

a GBS toxin receptor or fragment thereof; 
an inhibitor of binding of GBS toxin to a GBS toxin receptor; 
an inhibitor of a GBS toxin receptor; and 
20 a chimeric compound comprising a cytototoxic agent coupled to a 

compound that binds GBS toxin receptor. 

44. Use of an inhibitor of a GBS toxin receptor, or of an inhibitor of binding of 
GBS toxin to a GBS toxin receptor, for the manufacture of a medicament f^^ 
treatment of a medical condition characterized by pathologic or hypoxia-driven 
25 angiogenesis or neovascularization. 
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SEQUENCE LISTING 

<110> Hellerqvist, Carl 
Fu, Changlin 

<120> GBS Toxin Receptor 

<130> CARB-008/01WO 

<140> 
<141> 

<150> 60-093,843 
<151> 1998-07-22 

<160> 12 

<170> Patentin Ver. 2.0 

<210> 1 
<211> 2602 
<212> ONA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (58) . . (1542) 
<400> 1 

tcgggccggc gctcccttct ctgccaggtg gcgagtacac ctgctcacgt aggcgtc 57 

atg agg tct ccg gtt cga gac ctg gcc egg aac gat ggc gag gag age 105 
Met Arg Ser Pro Val Arg Asp Leu Ala Arg Asn Asp Gly Glu Glu Ser 
1 5 10 15 

acg gac cgc acg cct ctt eta ccg ggc gcc cca egg gcc gaa gee get 153 
Thr Asp Arg Thr Pro Leu Leu Pro Gly Ala Pro Arg Ala Glu Ala Ala 
20 25 30 

cca gtg tgc tgc tct get cgt tac aac tta gca att ttg gee ttt ttt 201 
Pro Val Cys Cys Ser Ala Arg Tyr Asn Leu Ala He Leu Ala Phe Phe 
35 40 45 

ggt ttc ttc att gtg tat gca tta cgt gtg aat ctg agt gtt geg tta 249 
Gly Phe Phe He Val Tyr Ala Leu Arg Val Asn Leu Ser Val Ala Leu 
50 55 60 

gtg gat atg gta gat tea aat aca act tta gaa gat aat aga act tec 297 
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Val Asp Met Val Asp Ser Asn Thr Thr Leu Glu Asp Asn Arg Thr Ser 
65 70 75 80 

aag gcg tgt cca gag cat tct get ccc aca aaa get cat cat aat caa 345 
Lys Ala Cys Pro Glu His Ser Ala Pro He Lys Val His His Asn Gin 
85 90 95 

acg ggt aag aag tac caa tgg gat gca gaa act caa gga tgg att etc 393 
Thr Gly Lys Lys Tyr Gin Trp Asp Ala Glu Thr Gin Gly Trp He Leu 
100 105 110 

ggt tec ttt ttt tat ggc tae ate ate aca cag att cct gga gga tat 441 
Gly Ser Phe Phe Tyr Gly Tyr He He Thr Gin He Pro Gly Gly Tyr 
115 120 125 

gtt gee age aaa ata ggg ggg aaa atg ctg eta gga ttt ggg ate ctt 489 
Val Ala Ser Lys He Gly Gly Lys Met Leu Leu Gly Phe Gly He Leu 
130 135 140 

ggc act get gtc etc acc ctg tte act ccc att get gca gat tta gga 537 
Gly Thr Ala Val Leu Thr Leu Phe Thr Pro He Ala Ala Asp Leu Gly 
145 150 155 160 

gtt gga cca etc att gta etc aga gca eta gaa gga eta gga gag ggt 585 
Val Gly Pro Leu He Val heu Arg Ala Leu Glu Gly Leu Gly Glu Gly 
165 170 175 

gtt aca ttt cca gee atg cat gee atg tgg tct tct tgg get cee cct 633 
Val Thr Phe Pro Ala Met His Ala Met Trp Ser Ser Trp Ala Pro Pro 
180 185 190 

ctt gaa aga age aaa ctt ctt age att teg tat gca gga gca cag ctt 681 
Leu Glu Arg Ser Lys Leu I#eu Ser He Ser Tyr Ala Gly Ala Gin Leu 
195 200 205 

ggg aca gta att tct ctt cct ctt tct gga ata att tgc tac tat atg 729 
Gly Thr Val He Ser Leu Pro Leu Ser Gly He He Cys Tyr Tyr Met 
210 215 220 

aat tgg act tat gtc ttc tac ttt ttt ggt act att gga ata ttt tgg 777 
Asn Trp Thr Tyr Val Phe Tyr Phe Phe Gly Thr He Gly He Phe Trp 
225 230 235 240 

ttt ctt ttg tgg ate tgg tta gtt agt gac aca cca caa aaa cac aag 825 
Phe Leu Leu Trp He Trp Leu Val Ser Asp Thr Pro Gin Lys His Lys 
245 250 2S5 

aga att tec cat tat gaa aag gaa tac att ctt tea tea tta aga aat 873 
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Arg lie Ser His Tyr Glu Lys Glu Tyr lie Leu Ser Ser Leu Arg Asn 
260 265 270 

cag ctt tct tea cag aag tea gtg ccg tgg gta ccc att tta aaa tec 921 
Gin L-eu Ser Ser Gin Lys Ser Val Pro Trp Val Pro lie Leu Lys Ser 
275 280 285 

ctg cca ctt tgg get ate gta gtt gca cae ttt tct tac aac tgg act 969 
Leu Pro Leu Trp Ala lie Val Val Ala His Phe Ser Tyr Asn Trp Thr 
290 295 300 

ttt tat act tta ttg aca tta ttg cct act tat atg aag gag ate eta 1017 
Phe Tyr Thr Leu Leu Thr Leu Leu Pro Thr Tyr Met Lys Glu He Leu 
305 310 315 320 

agg ttc aat gtt caa gag aat ggg ttt tta tct tea ttg cct tat tta 1065 
Arg Phe Asn Val Gin Glu Asn Gly Phe Leu Ser Ser Leu Pro Tyr Leu 
325 330 335 

gge tct tgg tta tgt atg ate ctg tct ggt caa get get gac aat tta 1113 
Gly Ser Trp Leu Cys Met He Leu Ser Gly Gin Ala Ala Asp Asn Leu 
340 345 350 

agg gca aaa tgg aat ttt tea act tta tgt gtt cgc aga att ttt age 1161 
Arg Ala Lys Trp Asn Phe Ser Thr Leu Cys Val Arg Arg He Phe Ser 
355 360 365 

ctt ata gga atg att gga cct gca gta ttc ctg gta get get ggc ttc 1209 
Leu He Gly Met He Gly Pro Ala Val Phe Leu Val Ala Ala Gly Phe 
370 375 380 

att ggc tgt gat tat tct ttg gcc gtt get ttc eta act ata tea aca 1257 
He Gly Cys Asp Tyr Ser Leu Ala Val Ala Phe Leu Thr He Ser Thr 
385 390 395 400 

aca ctg gga ggc ttt tgc tct tct gga ttt age ate aac cat ctg gat 1305 
Thr Leu Gly Gly Phe Cys Ser Ser Gly Phe Ser He Asn His Leu Asp 
405 410 415 

att get cct teg tat get ggt ate etc ctg ggc ate aca aat aca ttt 1353 
He Ala Pro Ser Tyr Ala Gly He Leu Leu Gly He Thr Asn Thr Phe 
420 425 430 

gee act att cca gga atg gtc ggg ccc gtc att get aaa agt ctg acc 1401 
Ala Thr He Pro Gly Met Val Gly Pro Val He Ala Lys Ser Leu Thr 
435 440 445 

cct gat aac act gtt gga gaa tgg caa acc gtg ttc tat att get get 1449 
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Pro Asp Asn Thr Val Gly Glu Trp Gin Thr Val Phe Tyr lie Ala Ala 
450 455 460 

gcc act aac gcc cct ggt gcc act ttc ttt aca eta tec gcc aaa ggc 1497 
Ala He Asn Val Phe Gly Ala He Phe Phe Thr Leu Phe Ala Lys Gly 

470 475 



gaa gta caa aac tgg get etc aat gat cac cat gga cac aga cac 
Glu Val Gin Asn Trp Ala Leu Asn Asp His His Gly His Arg His 
485 490 455 



1542 



tgaaggaacc aataaataat cctgcctcta ttaatgtatt tttatttatc atgtaacctc 1602 
aaagtgcctt ctgtattgtg taagcattct atgtcttttt ttaattgtac ttgtattaga 1662 
tttttaaggc ctataatcac gaaatatcac tagttgccag aataataaaa tgaactgtgt 1722 
ttaattatga ataatatgta agctaggact tctactttag gttcacatac ctgcctgcta 1782 
gtcgggcaac acgaagcagg acagctctgt tgatttttta gggccatact aaagggaatg 1842 
agctgaaaca gacctcctga tacctttgct taattaaact agatgataat tctcaggtac 1902 
tgataaacac ctgttgttgt tcactttcct cataaaaatt gtcagctctc tctgacactt 1962 
agacctcaaa ctttagcatc tctgtggagc tgccatccac tgtataattt cgcctggcaa 2022 
ctggactgag gggagtgtgc ccaggcagct gccaagcact ccccccctgg cttcagggtc 2082 
agagtgccca gcgtttatca gaggcagcat ccaagcccag agccagtgtc gactcttcgg 2142 
ctggtgcctt tccectgagg ggctatcaat gtgtagataa agccctgagt aggcaagagc 2202 
agtgagatcc actgctatgg tcttgataca tcctcaaact ttcccttccc agcacagagg 2262 
aatattggct ggcatgcaac ctgcaaaaga aaaatgcgaa gcggccgggc acggtggctc 2322 
atgcctgtaa tcccagcact ttggggggct gaggtgggcg aatcatgaga tcaggagttc 2382 
gagaccagcc tggccagcat ggtgaaaccc catctctact aaaaacacaa aaaattagct 2442 
gggcgtggtg acgggcgcct gtaatcccag atactcagga ggctgaggta ggagaatcac 2502 
ttgaacctgg gaggcggaag ttgcagtgaa ccaagatcac gccactgcac tecagcctgg 2562 
gcgatggagc gagactccaa ctcaaaaaaa aaaaaaaaaa 2602 



<WO_OOOS37SA1J.> 



wo 00/05375 



PCT/US99/16676 



<210> 2 
<211> 495 
<212> PRT 

<213> Homo sapiens 
<400> 2 

Met Arg Ser Pro Val Arg Asp Leu Ala Arg Asn Asp Gly Glu Glu Ser 
15 10 15 

Thr Asp Arg Thr Pro Leu Leu Pro Gly Ala Pro Arg Ala Glu Ala Ala 
20 25 30 

Pro Val Cys Cys Ser Ala Arg Tyr Asn Leu Ala He Leu Ala Phe Phe 
35 40 45 

Gly Phe Phe He Val Tyr Ala Leu Arg Val Asn Leu Ser Val Ala Leu 
50 55 60 

Val Asp Met Val Asp Ser Asn Thr Thr Leu Glu Asp Asn Arg Thr Ser 
65 70 75 80 

Lys Ala Cys Pro Glu His Ser Ala Pro He Lys Val His His Asn Gin 
85 90 95 

Thr Gly Lys Lys Tyr Gin Trp Asp Ala Glu Thr Gin Gly Trp He Leu 
100 105 110 

Gly Ser Phe Phe Tyr Gly Tyr He He Thr Gin He Pro Gly Gly Tyr 
115 120 125 

Val Ala Ser Lys He Gly Gly Lys Met Leu Leu Gly Phe Gly He Lieu 
130 135 140 

Gly Thr Ala Val Leu Thr Leu Phe Thr Pro He Ala Ala Asp Leu Gly 

145 . . _.150 . . 155 _ . _ ,160 

Val Gly Pro Leu He Val Leu Arg Ala Leu Glu Gly Leu Gly Glu Gly 
165 170 175 

Val Thr Phe Pro Ala Met His Ala Met Trp Ser Ser Trp Ala Pro Pro 
180 185 190 

Leu Glu Arg Ser Lys Leu Leu Ser He Ser Tyr Ala Gly Ala Gin Leu 
195 200 205 

Gly Thr Val He Ser Leu Pro Leu Ser Gly He He Cys Tyr Tyr Met 
210 215 220 
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Asn Trp Thr Tyr Val Phe T\'r Phe Phe Gly Thr lie Gly lie Phe Trp 
225 230 235 240 

Phe Leu Leu Trp He Trp Leu Val Ser Asp Thr Pro Gin Lys His Lys 
245 250 255 



Arg He Ser His Tyr Glu Lys Glu 
260 

Gin Leu Ser Ser Gin Lys Ser Val 
275 280 

Leu Pro Leu Trp Ala He Val Val 
290 295 

Phe Tyr Thr Leu Leu Thr Leu Leu 
305 310 



Tyr lie Leu Ser Ser Leu Arg Asn 
265 270 

Pro Trp Val Pro He Leu Lys Ser 
285 

Ala His Phe Ser Tyr Asn Trp Thr 
300 

Pro Thr Tyr Met Lys Glu He Leu 
315 320 



Arg Phe Asn Val Gin Glu Asn Gly 
325 

Gly Ser Trp Leu Cys Met He Leu 
340 

Arg Ala Lys Trp Asn Phe Ser Thr 
355 360 

l^u He Gly Met He Gly Pro Ala 
370 375 

He Gly Cys Asp Tyr Ser Leu Ala 
385 390 



Phe Leu Ser Ser Leu Pro Tyr Leu 
330 335 

Ser Gly Gin Ala Ala Asp Asn Leu 
345 350 

Leu Cys Val Arg Arg He Phe Ser 
365 

Val Phe Leu Val Ala Ala Gly Phe 
380 

Val Ala Phe Leu Thr He Ser Thr 
395 400 



Thr Leu Gly Gly Phe Cys Ser Ser Gly Phe Ser He Asn His Leu Asp 

405 - - - - ^ _4i5_ _ 

He Ala Pro Ser Tyr Ala Gly He Leu Leu Gly He Thr Asn Thr Phe 
420 425 430 



Ala Thr He Pro Gly Met Val Gly Pro Val He Ala Lys Ser Leu Thr 
435 440 445 

Pro Asp Asn Thr Val Gly Glu Trp Gin Thr Val Phe Tyr He Ala Ala 
450 455 460 

Ala He Asn Val Phe Gly Ala He Phe Phe Thr Leu Phe Ala Lys Gly 
465 470 475 480 
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Glu Val Gin Asn Trp Ala Leu Asn Asp His His Gly His Arg His 
485 490 



<210_> 3 
<211> 2844 
<212> DNA 
<213> Ovis sp. 

<220> 
<221> CDS 

<222> (84), . . (1568) 
<400> 3 

cccgggggcg gggggcttcg gcggtcccgc tggagctctc ttttccgcgg agcaggtttg 60 

cgccgtagct ccctgaaggc ate atg aag tec ccg gtt teg gae tta gee ecg 113 

Met Lys Ser Pro Val Ser Asp Leu Ala Pro 

age gae ggc gag gag gge teg gae ego aca ccg etc ctg cag cge gcc 161 
Ser Asp Gly Glu Glu Gly Ser Asp Arg Thr Pro Leu Leu Gin Arg Ala 
15 20 25" 

ccg egg gcg gaa cce get eca gta tgc tge tct get egt tac aac eta 209 
Pro Arg Ala Glu Pro Ala Pro Val Cys Cys Ser Ala Arg Tyr Asn Leu 
30 35 40 

gea ttt ttg tec ttt ttt ggt tte tte gtt etc tat tea tta egg gtg 257 
Ala Phe Leu Ser Phe Phe Gly Phe Phe Val Leu Tyr Ser Leu Arg Val 
45 50 55 



aat ctg age gtt gea eta gtg gae atg gtg gat tea aae aca act gee 
Asn Leu Ser Val Ala Leu Val Asp Met Val Asp Ser Asn Thr Thr Ala 
60 65 70 

aaa gat aat aga acg tec tac gag tgt gea gag cat tet get ccc ata 
Lys Asp Asn Arg Thr Ser Tyr Glu Cys Ala Glu His Ser Ala Pro He 
75 80 85 90 

aaa gtt ett cac aac caa acg ggt aaa aag tac egg tgg gat gea gaa 
Lys Val Leu His Asn Gin Thr Gly Lys Lys Tyr Arg Trp Asp Ala Glu 
95 . 100 105 

act caa gga tgg att etc gga tct ttt tte tat ggc tac ate ate aca 
Thr Gin Gly Trp He Leu Gly Ser Phe Phe Tyr Gly Tyr He He Thr 
110 115 120 



305 



353 



401 



449 
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caa ate cct gga gga tac gtz gcc age aga agt ggg ggg aag ccg etc 497 
Gin lie Pro Gly Gly Tyr Val Ala Ser Arg Ser Gly Gly Lys Leu Leu 
125 130 135 

eta gga ttc ggg ate ttt get aca get ate ttc ace etg ttc act ccc 545 
Leu Gly Phe Gly He Phe Ala Thr Ala He Phe Thr Leu Phe Thr Pro 
140 145 150 

etc get gca gat ttc gga gte gga gcc ctt gtt gca etc agg gea eta 593 
Leu Ala Ala Asp Phe Gly Val Gly Ala Leu Val Ala Leu Arg Ala Leu 
155 160 165 170 

gaa ggg eta gga gag ggt gte aca tat cca gcc atg cat gcc atg tgg 641 
Glu Gly Leu Gly Glu Gly Val Thr Tyr Pro Ala Met His Ala Met Trp 
175 180 185 

net tea tgg get ccc cct ctt gaa aga age aag ctt etg agt att tea 669 
Ser Ser Trp Ala Pro Pro Leu Glu Arg Ser Lys Leu Leu Ser He Ser 
190 195 200 

tat gca gga gca caa ctt ggg aca gta gtt tct ctt cct ctt tct gga 737 
Tyr Ala Gly Ala Gin Leu Gly Thr Val Val Ser Leu Pro Leu Ser Gly 
205 210 215 

gta att tgc tac tat atg aat tgg act tat gte ttc tat ttc ttt ggc 785 
Val lie Cys Tyr Tyr Met Asn Trp Thr Tyr Val Phe Tyr Phe Phe Gly 
220 225 230 

att gtt gga ate ate tgg ttt att tta tgg ate tgc tta gtt agt gat 833 
He Val Gly He He Trp Phe He Leu Trp He Cys Leu Val Ser Asp 
235 240 245 250 

aca cca gaa act cac aag aca ate act ccc tat gaa aag gag tat att 881 
Thr Pro Glu Thr His Lys Thr He Thr Pro Tyr Glu Lys Glu Tyr He 
255 260 265 

ctt tea tea tta aaa aat cag etc tct tea cag aag tea gtg ccg tgg 929 
Leu Ser Ser Leu Lys Asn Gin Leu Ser Ser Gin Lys Ser Val Pro Trp 
270 275 280 

ata cct atg etg aaa tea etg eca ctt tgg get att gte gtt gca cat 977 
He Pro Met Leu Lys Ser Leu Pro Leu Trp Ala He Val Val Ala His 
285 290 295 

ttt tct tac aae tgg act ttt tat act ttg ttg ace tta ttg cct act 1025 
Phe Ser Tyr Asn Trp Thr Phe Tyr Thr Leu Leu Thr Leu Leu Pro Thr 
300 305 310 
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cac atg aag gaa gcc eta agg etc aac ate caa gag aat ggg ttt tta 1073 
Tyr Met Lys Giu Val Leu Arg Phe Asn lie Gin Glu Asn Gly Phe Leu 
315 320 325 330 



tct gca gtc cct tat tta ggt tgt tgg 
Ser Ala Val Pro Tyr Leu Gly Cys Trp 
335 



tta tgt atg ate ctg teg ggt 1121 
Leu cys Met lie Leu Ser Gly 
340 345 



caa get get gac aat tta agg gca aga tgg aat ttt tea act ctg tgg 1169 
Gin Ala Ala Asp . Asn Leu Arg Ala Arg Trp Asn Phe Ser Thr Leu Trp 
350 355 360 



gtt cga aga gtt ttt age ctt ata ggg atg att gga cct gcg ata ttc 1217 
Val Arg Arg Val Phe Ser Leu He Gly Met He Gly Pro Ala He Phe 
365 370 375 



ctg gtt gcc gca gga ttt ata ggc tgt gat tat tec ttg get gtt gca 1265 
Leu Val Ala Ala Gly Phe He Gly Cys Asp Tyr Ser Leu Ala Val Ala 
380 ^ 385 390 



ttc eta acc ata tea aca acc ctg gga 
Phe Leu Thr He Ser Thr Thr Leu Gly 
395 400 

age ate aac cat ctg gac att get cct 
Ser He Asn His Leu Asp He Ala Pro 
415 

ggc ate aca aat acc ttt gcc act att 
Gly He Thr Asn Thr Phe Ala Thr He 
430 435 



ggc ttt tgc tct tct gga ttt 1313 
Gly Phe Cys Ser Ser Gly Phe 
405 410 

teg tat get ggt att etc ctg 1361 
Ser Tyr Ala Gly He Leu Leu 
420 425 

cct gga atg att ggg ccc ate 1409 
Pro Gly Met He Gly Pro He 
440 



att gee aga agt ctt acc cct gag aac act att gga gaa tgg caa act 1457 
He Ala Arg Ser Leu Thr Pro Glu Asn Thr He Gly Glu Trp Gin Thr 
445 ^ 450 _ . 455 

gtt ttc tgc ate get get get ate aat gta ttt ggt gcc att ttc ttc 1505 
Val Phe Cys He Ala Ala Ala He Asn Val Phe Gly Ala He Phe Phe 
460 465 470 



aca eta ttc gcc aaa ggt gaa gtg caa aac tgg gcc ate agt gat cac 1553 
Thr Leu Phe Ala Lys Gly Glu Val Gin Asn Trp Ala He Ser Asp His 
475 480 485 490 

caa gga cac aga aac tgaaggaacc aataaataat cetgtctcta ttaatgtatc 1608 
Gin Gly His Arg Asn 
495 
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tctgrccacc acgLaaccta aaagcgcctr cgacatttca acgtgtaagc aatccacaca 1668 
caagacaaaa ctgtactaga aaaatcgtgc cagatttgta aggcttgtaa tcatgaaatg 1728 
tcaccagttg ccatataagc aaaactagct atttttaatt attattaacc cgtctgctgg 1788 
aacttacaat ccagggtcac atatctggct gcaagtcagg caacccacaa caggggagtt 1848 
ctatttattt ataagaccat acccaaagag acgagctgaa atagaccctt ctataccttt 1908 
gcttaattaa ggtggataat aattctcagg tcttgttaaa catctgtttt tgtacacctt 1968 
cctcaaaaaa ttatttgtca tcagcaatcc ctgacatgta ggtctcaaac tttagcctct 2028 
ccacggagct ggcagccact gtatcattca gcctggcaac ttcactgagg gaagcatgcc 2088 
caggcagctg ccacatgtcc cctctctggc ttcagggaca gtgcccagca ctcaggcagc 2148 
atccaagacc agggtcagcg ccaaggcttt ggacggtatt cttcccctgg ggctgttaat 2208 
gtgtggatga agccctgagc caacagggac agcgcgatcc acagtcatgg tttccatgca 2268 
ccctctccct tcccttccca gcacactgga gtattgcctg gcatgtaacc tgcaaaagaa 2328 
agtigtgatgc ctaattagcc acatataaca tcatccttga tgatcctacc ttcacatgga 2388 
tcagagtata aatcttcaag tcctgtgttc taggagctac accagaataa ttaaaatata 2448 
aaaagaaaca aaacattttt tctgtctgac acctaagcgt ctggttgcag ttcaaggtta 2508 
aagtgacttc tacttcacat aacctgcaac cggtggtgta atcatcttta gtgttggttt 2568 
cttaaacctt atttttccag tttttcctgg accatcttcc agtggttttg agcatgcttt 2628 
gagggcattt atgtgattta gaacttgatt aatgtttcac tgtgtatgtt caacactacc 2688 
tgcaatattt taactaaagc tatttaatgt aatatgatgt gtatacattc tgtaaattaa 2746 
tttttaaatc tgtaaatagc tttaagttgc tatggtgata tttcttttac aaatcaaaat 2808 
aaaccttttt ggaacgataa aaaaaaaaaa aaaaaa 2844 

<210> 4 ^ 
<211> 495 
<212> PRT 
<213> Ovis sp. 
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<400> 4 

Met Lys Ser Pro Val Ser Asp Leu Ala Pro Ser Asp Gly Glu Glu Gly 
15 10 15 

Ser Asp Arg Thr Pro Leu Leu Gin Arg Ala Pro Arg Ala Glu Pro Ala 
20 25 30 

Pro Val Cys Cys Ser Ala Arg Tyr Asn Leu Ala Phe Leu Ser Phe Phe 
35 40 45 

Gly Phe Phe Val Leu Tyr Ser Leu Arg Val Asn Leu Ser Val Ala Leu 
50 55 60 

Val Asp Met Val Asp Ser Asn Thr Thr Ala Lys Asp Asn Arg Thr Ser 
€5 70 75 80 

Tyr Glu Cys Ala Glu His Ser Ala Pro He Lys Val Leu His Asn Qln 
85 90 95 

Thr Gly Lys Lys Tyr Arg Trp Asp Ala Glu Thr Gin Gly Trp He Leu 
100 105 110 

Gly Ser Phe Phe Tyr Gly Tyr He He Thr Gin He Pro Gly Gly Tyr 
115 ' 120 125 

Val Ala Ser Arg Ser Gly Gly Lys Leu Leu Leu Gly Phe Gly He Phe 
130 135 140 

Ala Thr Ala He Phe Thr Leu Phe Thr Pro Leu Ala Ala Asp Phe Gly 
145 150 155 160 

Val Gly Ala Leu Val Ala Leu Arg Ala Leu Glu Gly Leu Gly Glu Gly 
165 170 175 

Val Thr Tyr Pro Ala Met His Ala Met Trp Ser Ser Trp Ala Pro Pro 
180 185 190 

Leu Glu Arg Ser Lys Leu Leu Ser He Ser Tyr Ala Gly Ala Gin Leu 
155 200 205 

Gly Thr Val Val Ser Leu Pro Leu Ser Gly Val He Cys Tyr Tyr Met 
210 215 220 

Asn Trp Thr Tyr Val Phe Tyr Phe Phe Gly He Val Gly He He Trp 
225 230 235 240 

Phe He Leu Trp He Cys Leu Val Ser Asp Thr Pro Glu Thr His Lys 
245 250 255 
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Thr lie Thr Pro Tyr Glu Lys Glu Tyr He Leu Ser Ser Leu Lys Asn 
260 265 



270 



Gin Leu Ser Ser Gin Lys Ser Val Pro Trp He Pro Met Leu Lys Ser 
275 280 



285 



Leu Pro Leu Trp Ala He Val Val Ala His Phe Ser Tyr Asn Trp Thr 
290 295 



300 



Phe Tyr Thr Leu Leu Thr Leu Leu Pro Thr Tyr Met Lys Glu Val Leu 

315 320 



Arg Phe Asn He Gin Glu Asn Gly Phe Leu Ser Ala Val Pro Tyr Leu 
325 330 



335 



Gly Cys Trp Leu Cys Met He Leu Ser Gly Gin Ala Ala Asp Asn Leu 
340 345 350 

Arg Ala Arg Trp Asn Phe Ser Thr Leu Trp Val Arg Arg Val Phe Ser 
355 360 

Leu He Gly Met He Gly Pro Ala He Phe Leu Val Ala Ala Gly Phe 
370 375 380 

He Gly Cys Asp Tyr Ser Leu Ala Val Ala Phe Leu Thr He Ser Thr 

390 395 400 

Thr Leu Gly Gly Phe Cys Ser Ser Gly Phe Ser He Asn His Leu Asp 
405 410 415 

He Ala Pro Ser Tyr Ala Gly He Leu Leu Gly He Thr Asn Thr Phe 
420 425 430 

Ala. Thr He Pro Gly Met He Gly Pro He He Ala Arg Ser Leu Thr- 
435 440 445 

Pro Glu Asn Thr He Gly Glu Trp Gin Thr Val Phe Cys He Ala Ala 
450 ' 455 460 



Ala He Asn Val Phe Gly Ala He Phe Phe Thr Leu Phe Ala i;ys Gly 
465 470 ' 475 



480 



Glu Val Gin Asn Trp Ala He Ser Asp His Gin Gly His Arg Asn 
485 490 495 



<210> 5 
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<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 5 

cgggatcccg ccngcnatgc ayrshrtstg g ^. 



<210> 6 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 6 

ggaattccdg gdgcratktc narrtrrtt 29 

<210> 7 
<211> 2930 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (263) . . (1870) 
<400> 7 

gttcggtcga agccctcccc ttaattatgt gcaattcaag tccccactgc ccgcccgcaa 60 

gcccccactc atcctcgctg cgggcagggt ggcccctgca ctttacaagg gggtgcagga 120 

gcgggagacg gtcgtccgaa caccggctcc ccggcatgcg tagaccggcg ggcggagcgg 180 

gctcactttg cgccaatcct acgagaactc ccagaactcc gcttccctag tccaacccaa 240 

gccagagttg cccacaccta ag atg gcg gcg ggg gcg atg aca ccg ccc cgc 292 

Met Ala Ala Gly Ala Met Thr Pro Pro Arg 
1 5 10 

ccg gtc cag cca get egg ccc ggg ggc ttc ggg etg teg ggc egg cgc 340 
Pro Val Gin Pro Ala Arg Pro Gly Gly Phe Gly Leu Ser Gly Arg Arg 
15 20 25 
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tec etc etc tgc cag gtg gcg age aca cct gcc cac gca ggc gcc atg 388 
Ser Leu Leu Cys Gin Val Ala Ser Thr Pro Ala His Val Gly Val Met 
• 3° 35 40 

agg tct ccg gtt cga gac ctg gcc egg aac gat ggc gag gag age aeg 436 
Arg Ser Pro Val Arg Asp Leu Ala Arg Asn Asp Gly Glu Glu Ser Thr 
45 50 55 

gac cgc aeg cct ett eta ccg ggc gcc cca egg gee gaa gcc get eca 484 
Asp Arg Thr Pro Leu Leu Pro Gly Ala Pro Arg Ala Glu Ala Ala Pro 
60 65 70 

gtg tgc tgc tct get cgt tac aac tta gca att ttg gcc ttt ttt ggt 532 

Val Cys Cys Ser Ala Arg Tyr Asn Leu Ala lie Leu Ala Phe Phe Gly 

75 80 fl«; 

85 30 



ttc ttc att gtg tat gca tta cgt gtg aat ctg agt gtt gcg tta gtg 
Phe Phe lie Val Tyr Ala Leu Arg Val Asn Leu Ser Val Ala Leu Val 
95 100 

gat atg gta gat tea aat aca act tta gaa gat aat aga act tec aag 
Asp Met Val Asp Ser Asn Thr Thr Leu Glu Asp Asn Arg Thr Ser Lys 
"° 1X5 120 

gcg tgt cca gag cat tct get ccc ata aaa gtt cat cat aat caa aeg 
Ala Cys Pro Glu His Ser Ala Pro He Lys Val His His Asn Gin Thr 
125 130 135 

ggt aag aag tac caa tgg gat gca gaa act caa gga tgg att etc ggt 
Gly Lys Lys Tyr Gin Trp Asp Ala Glu Thr Gin Gly Trp He Leu Gly 
"0 145 ISO 



580 



628 



676 



724 



tec ttt ttt tat ggc tac ate ate aca cag att cct gga gga tat gtt 772 
Ser Phe Phe Tyr Gly Tyr He He Thr Gin He Pro Gly Gly Tyr Val 
"5 160 165 170 

gcc age aaa aCa ggg ggg aaa atg ctg eta gga ttt ggg ate ett ggc 820 
Ala Ser Lys lie Gly Gly Lys Met Leu Leu Gly Phe Gly He Leu Gly 
^■'S 180 185 

act get gte etc ace ctg ttc act ccc att get gca gat tta gga gtt 868 
Thr Ala Val Leu Thr Leu Phe Thr Pro He Ala Ala Asp Leu Gly Val 
190 195 200 

gga cca etc att gta etc aga gca eta gaa gga eta gga gag ggt gtt 9i6 
Gly Pro Leu He Val Leu Arg Ala Leu Glu Gly Leu Gly Glu Gly Val 
205 210 215 
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aca ttz cca gcc atg cat gcc acg egg tec tct tgg get ccc ccc ctt 964 
Thr Phe Pro Ala Met His Ala Met Trp Ser Ser Trp Ala Pro Pro Leu 
220 225 230 

gaa aga age aaa ctt ctt age att teg tat gea gga gca cag ctt ggg 1012 
Glu Arg Ser Lys Leu Leu Ser lie Ser Tyr Ala Gly Ala Gin Leu Gly 
235 240 245 250 

aca gta att tct ctt cct ctt tct gga ata att tgc tac tat atg aat 106O 
Thr Val He Ser Leu Pro Leu Ser Gly He He Cys Tyr Tyr Met Asn 
255 260 265 



tgg act tat gtc ttc tac ttt ttt ggt act att gga ata ttt tgg ttt 1108 
Trp Thr Tyr Val Phe Tyr Phe Phe Gly Thr He Gly He Phe Trp Phe 
270 275 280 



ctt ttg tgg ate tgg tta gtt agt gac aca cca caa aaa cac aag aga 1156 
Leu Leu Trp He Trp Leu Val Ser Asp Thr Pro Gin Lys His Lys Arg 
285 290 295 



att tec cat tat gaa aag gaa tac att ctt tea tea tta aga aat cag 1204 
He Ser His Tyr Glu Lys Glu Tyr He Leu Ser Ser Leu Arg Asn Gin 
300 305 310 

ctt tct tea cag aag tea gtg ecg tgg gta eec att tta aaa tee ctg 1252 
Leu Ser Ser Gin Lys Ser Val Pro Trp Val Pro He Leu Lys Ser Leu 
315 320 325 330 

cca ctt tgg get ate gta gtt gca eac ttt tct tac aac tgg act ttt 1300 
Pro Leu Trp Ala He Val Val Ala His Phe Ser Tyr Asn Trp Thr Phe 
335 340 345 



tat act tta ttg aca tta ttg cet act tat atg aag gag ate eta agg 1343 
Tyr Thr Leu I*eu Thr Leu Leu Pro Thr Tyr Met Lys Glu He Leu Arg 
350 355 360 

tte aat gtt caa gag aat ggg ttt tta tct tea ttg cet tat tta gge 1396 
Phe Asn Val Gin Glu Asn Gly Phe Leu Ser Ser Leu Pro Tyr Leu Gly 
365 370 375 



tct tgg tta tgt atg ate ctg tct ggt caa get get gae aat tta agg 1444 
Ser Trp Leu Cys Met He Leu Ser Gly Gin Ala Ala Asp Asn Leu Arg 
380 385 390 

gca aaa tgg aat ttt tea act tta tgt gtt cge aga att ttt age ctt 1492 
Ala Lys Trp Asn Phe Ser Thr Leu Cys Val Arg Arg He Phe Ser Leu 
395 400 405 410 
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ata gga acg ate gga cct gca gca etc ctg gca get get ggc ttc att 1540 
He Gly Met He Gly Pro Ala Val Phe Leu Val Ala Ala Gly Phe He 
415 '420 425 



ggc tgt gat tat tct ttg gcc gtt get ttc eta act ata tea aca aca 
Gly Cys Asp Tyr Ser Leu Ala Val Ala Phe Leu Thr He Ser Thr Thr 
430 435 440 



1588 



ctg gga gge ttt tgc tet tet gga ttt age ate aae cat ctg gat att 1636 
Leu Gly Gly Phe Cys Ser Ser Gly Phe Ser He Asn His Leu Asp He 
445 450 455 

get cct teg tat get ggt ate etc ctg ggc ate aca aat aca ttt gcc 1684 
Ala Pro Ser Tyr Ala Gly He Leu Leu Gly He Thr Asn Thr Phe Ala 
460 465 470 

act att cea gga atg gtt ggg ccc gtc att get aaa agt ctg ace cct 1732 
Thr He Pro Gly Met Val Gly Pro Val He Ala Lys Ser Leu Thr Pro 
475 480 485 490 

gat aac act gtt gga gaa tgg caa acc gtg ttc tat att get get get 1780 
Asp Asn Thr Val Gly Glu Trp Gin Thr Val Phe Tyr He Ala Ala Ala 
495 ' 500 505 

att aat gtt ttt ggt gcc att ttc ttt aca eta ttc gcc aaa ggt gaa 1828 
He Asn Val Phe Gly Ala He Phe Phe Thr Leu Phe Ala Lys Gly Glu 
510 515 520 



gta caa aac tgg get etc aat gat cac cat gga cac aga cac 
Val Gin Asn Trp Ala Leu Asn Asp His His Gly His Arg His 
525 530 535 



1870 



tgaaggaacc aataaataat cctgcctcta ttaatgtatt tttatttatc atgtaacctc 1930 
aaagtgcctt ctgtattgtg taagcattct atgtcttttt ttaattgtac ttgtattaga 1990 
tttttaaggc ctataatcat gaaatatcac tagttgccag aataataaaa tgaactgtgt 2050 
ttaattatga ataatatgta agctaggact tctactttag gttcacatac ctgcctgcta 2110 
gtcgggcaac atgaagtagg acagttctgt tgatttttta gggccatact aaagggaatg 2170 
agctgaaaca gacctcctga tacctttgct taattaaaet agatgataat tctcaggtac 2230 
tgataaacac ctgttgttgt tcactttcct cataaaaatt gtcagctctc tctgacactt 2290 
agacctcaaa ctttagcatc tctgtggagc tgccatccac tgtataattt cgcctggcaa 2350 
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ctggaccgag gggagtgtgc ccaggcagct gccaagcact ccctccccgg ctccagggcc 2410 
agagtgccca gcgtttatca gaggcagcat ccaagcccag agccagtgtc gactcttcgg 2470 
ccggtgcctt tcctctgagg ggctatcaat gtgtagataa agccctgagt aggcaagagc 2530 
agtgagatcc actgctatgg tcttgataca tccccaaact ttcccttccc agcacagagg 2590 
aatattggct ggcatgcaac ctgcaaaaga aaaacgcgaa gcggccgggc acggtggctc 2650 
acgcctgcaa tcccagcact ttggggggct gaggtgggcg aatcatgaga tcaggagttc 2710 
gagaccagcc tggccagcat ggtgaaaccc catctctact aaaaatacaa aaaattagct 2770 
gggcgtggtg acgggcgcct gtaatcccag atactcagga ggctgaggta ggagaatcac 2830 
ttgaacctgg gaggtggaag ttgcagtgaa ccaagatcac gccactgcac tccagcctgg 2890 
gcgatggagc gagactccaa ctcaaaaaaa aaaaaaaaaa 2930 



<210> 8 
<211> 536 
<212> PRT 

<213> Homo sapiens 
<400> 8 

Met Ala Ala Gly Ala Met Thr Pro Pro Arg Pro Val Gin Pro Ala Arg 
15 10 15 

Pro Gly Gly Phe Gly Leu Ser Gly Arg Arg Ser Leu Leu Cys Gin Val 
20 25 30 

Ala Ser Thr Pro Ala His Val Gly Val Met Arg Ser Pro Val Arg Asp 
35 40 45 

Leu Ala Arg Asn Asp Gly Glu Glu Ser Thr Asp Arg Thr Pro Leu Leu 
50 55 60 

Pro Gly Ala Pro Arg Ala Glu Ala Ala Pro Val Cys Cys Ser Ala Arg 
65 70 75 80 

Tyr Asn Leu Ala lie Leu Ala Phe Phe Gly Phe Phe lie Val Tyr Ala 
85 90 95 

Leu Arg Val Asn Leu Ser Val Ala Leu Val Asp Met Val Asp Ser Asn 
100 105 110 
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Thr Thr Leu Glu Asp Asn Arg Thr Ser Lys Ala Cys Pro Glu His Ser 
115 120 125 

Ala Pro lie Lys Val i.is His Asn Gin Thr Gly Lys Lys Tyr Gin Trp 
130 135 140 

Asp Ala Glu Thr Gin Gly Trp He Leu Gly Ser Phe Phe Tyr Gly Tyr 
145 150 155 

He He Thr Gin He Pro Gly Gly Tyr Val Ala Ser Lys He Gly Gly 
165 170 175 

Lys Met Leu Leu Gly Phe Gly He Leu Gly Thr Ala Val Leu Thr Leu 
180 185 190 

Phe Thr Pro He Ala Ala Asp Leu Gly Val Gly Pro Leu He Val Leu 
135 200 205 

Arg Ala Leu Glu Gly Leu Gly Glu Gly Val Thr Phe Pro Ala Met His 
210 215 220 

Ala Met Trp Ser Ser Trp Ala Pro Pro Leu Glu Arg Ser Lys Leu Leu 
225 230 235 240 

Ser He Ser Tyr Ala Gly Ala Gin Leu Gly Thr Val He Ser Leu Pro 
245 250 255 

Leu Ser Gly He He Cys Tyr Tyr Met Asn Trp Thr Tyr Val Phe Tyr 
260 265 270 

Phe Phe Gly Thr He Gly He Phe Trp Phe Leu Leu Trp He Trp Leu 
275 280 285 

Val Ser Asp Thr Pro Gin Lys His Lys Arg He Ser His Tyr Glii Lys 
290 295 300 

Glu Tyr He Leu Ser Ser Leu Arg Asn Gin Leu Ser Ser Gin Lys Ser 
305 310 315 320 

Val Pro Trp Val Pro He Leu Lys Ser Leu Pro Leu Trp Ala He Val 
325 330 335 

Val Ala His Phe Ser Tyr Asn Trp Thr Phe Tyr Thr Leu Leu Thr I*eu 
340 345 350 

Leu Pro Thr Tyr Met Lys Glu He Leu Arg Phe Asn Val Gin Glu Asn 
355 360 365 
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Gly Phe Leu Ser Ser Leu Pro Tyr Leu Gly Ser Trp Leu Cys Met He 
370 375 380 

Leu Ser Gly Gin Ala Ala Asp Asn Leu Arg Ala Lys Trp Asn Phe Ser 

390 395 

Thr Leu Cys Val Arg Arg He Phe Ser Leu He Gly Met He Gly Pro 
405 410 

Ala Val Phe Leu Val Ala Ala Gly Phe He Gly Cys Asp Tyr Ser Leu 
420 425 430 

Ala Val Ala Phe Leu Thr He Ser Thr Thr Leu Gly Gly Phe Cys Ser 
435 440 445 

Ser Gly Phe Ser He Asn His Leu Asp He Ala Pro Ser Tyr Ala Gly 
450 455 460 

He Leu Leu Gly He Thr Asn Thr Phe Ala Thr He Pro Gly Met Val 
465 470 475 480 

Gly Pro Val lie Ala Lys Ser Leu Thr Pro Asp Asn Thr Val Gly Glu 
485 490 495 

Trp Gin Thr Val Phe Tyr He Ala Ala Ala He Asn Val Phe Gly Ala 
500 505 510 

He Phe Phe Thr Leu Phe Ala Lys Gly Glu Val Gin Asn Trp Ala Leu 
515 520 525 

Asn Asp His His Gly His Arg His 
530 535 



<210> 9 
<211> 1485 
<212> DNA 

c213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: human/sheep 
consensus sequence 

<220> 

<221> CDS 

<222> (1) (1485) 
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<400> 9 

atg arg tcy ccg gtt ysr gac ycr gcc csg arc gay ggc gag gag rgc 48 

Met Xaa Xaa Pro Val Xaa Asp Xaa Ala Xaa Xaa Xaa Gly Glu Glu Xaa 
1 _ 5 10 15 

wcg gac cgc acr cck cty ctr cmg sgc gcc ccr egg gcs gaa sec get 96 
Xaa Asp Arg Xaa Xaa Xaa Xaa Xaa Xaa Ala Xaa Arg Xaa Glu Xaa Ala 
20 25 30 

cca gtr tgc tgc tct get cgt tae aac yta gea wtt ttg kcc ttt ttt 144 
Pro Xaa Cys Cys Ser Ala Arg Tyr Asn Xaa Ala Xaa Leu Xaa Phe Phe 
35 40 45 

ggt ttc ttc rtt sts tat kca tta cgk gtg aat ctg agy gtt ger yta 192 
Gly Phe Phe Xaa Xaa Tyr Xaa Leu Xaa Val Asn Leu Xaa Val Xaa Xaa 
50 55 60 

gtg gay atg gtr gat tea aay aea act kym raa gat aat aga ack tee 240 
Val Xaa Met Xaa Asp Ser Xaa Thr Thr Xaa Xaa Asp Asn Arg Xaa Ser 
65 70 75 80 

was gmg tgt sea gag cat tct get cee ata aaa gtt cwt cay aay caa 288 
Xaa Xaa Cys Xaa Glu His Ser Ala Pro lie Lys Val Xaa Xaa Xaa Gin 
85 90 95 

aeg ggt aar aag tac err tgg gat gca gaa act caa gga tgg att etc 336 
Thr Gly Xaa Lys Tyr Xaa Trp Asp Ala Glu Thr Gin Gly Trp He Leu 
100 105 110 

ggw tcy ttt tty tat ggc tac ate ate aea car att cct gga gga tat 384 
Xaa Xaa Phe Xaa Tyr Gly Tyr He He Thr Xaa He Pro Gly Gly Tyr 
115 120 125 

gtt gcc age ara akw ggg ggg aar mtg ytg eta gga tty ggg ate ytt 432 
Val Ala Ser Xaa Xaa Gly Gly Xaa Xaa Xaa Leu Gly Xaa Gly He Xaa 
130 135 140 

gsy acw get rtc ytc ace ctg ttc act ccc mty get gca gat ttm gga 480 
Xaa Xaa Ala Xaa Xaa Thr Leu Phe Thr Pro Xaa Ala Ala Asp Xaa Gly 
145 150 155 160 

gty gga sctn cty rtt gya etc agr gca eta gaa ggr eta gga gag ggt 528 
Xaa Gly Xaa Xaa Xaa Xaa Leu Xaa Ala Leu Glu Xaa Leu Gly Glu Gly 
165 170 175 

gty aea twt cca gee atg cat gcc atg tgg tct tew tgg get cee cct 576 
Xaa Thr Xaa Pro Ala Met His Ala Met Trp Ser Xaa Trp Ala Pro Pro 
180 185 190 
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etc gaa aga age aar etc cck agy acc ccr cac gca gga gca car ctt 
Leu Glu Arg Ser Xaa Leu Xaa Xaa lie Xaa Tyr Ala Gly Ala Xaa Leu 
155 200 205 



aac tgg act tac gcc etc tay tty ttt ggy ayt rtt gga acm wty tgg 
Asn Trp Thr Tyr Val Phe Xaa Xaa Phe Xaa Xaa Xaa Gly Xaa Xaa Trp 
225 230 235 240 

ttt mtt ttr tgg ate tgs tta gtt agt gay aca cca saa amw cac aag 
Phe Xaa Xaa Trp lie Xaa Leu Val Ser Xaa Thr Pro Xaa Xaa His Lys 
245 250 255 

asa aty wcy cmy tat gaa aag gar tay att ctt tea tea tta ara aat 
Xaa Xaa Xaa Xaa Tyr Glu Lys Xaa Xaa lie Leu Ser Ser Leu Xaa Asn 
260 265 270 



ttt tat act ttr ttg acm tta ttg cct act tay atg aag gar rtc eta 
Phe Tyr Thr Xaa Leu Xaa Leu Leu Pro Thr Xaa Met Lys Xaa Xaa Leu 
305 310 315 320 

agg ttc aat rtt caa gag aat ggg ttt tta tct kca kts cct tat tta 
Arg Phe Asn Xaa Gin Glu Asn Gly Phe Leu~ Ser Xaa Xaa Pro Tyr l.eu 
325 330 335 



624 



ggg aca gta rtc tec ctt cct ctt tct gga rca acc cge cac tat atg 672 
Gly Thr Val Xaa Ser Leu Pro Leu Ser Gly Xaa lie Cys Tyr Tyr Met 
210 215 220 



Ctt ata ggr atg att gga cct gcr rta ttc ctg gtw gey gcw ggm tty 
Leu He Xaa Met He Gly Pro Xaa Xaa Phe Leu Xaa Xaa Xaa Xaa Xaa 
370 375 380 



720 



768 



816 



cag cty tct tea cag aag tea gtg ccg tgg rta cey atk ytr aaa tern 864 
Gin Xaa Ser Ser Gin Lys Ser Val Pro Trp Xaa Xaa Xaa Xaa Lys Xaa 
275 ' 280 285 

Ctg cca ctt tgg get aty gtm gtt gca cay ttt tct tac aac tgg act 912 
Leu Pro Leu Trp Ala Xaa Xaa Val Ala Xaa Phe Ser Tyr Asn Trp Thr 
290 295 300 



960 



1008 



ggy tst tgg tta tgt atg ate ctg tck ggt caa get get gae aat tta 1056 

Xaa Xaa Trp Leu Cys Met He Leu Xaa Gly Gin Ala Ala Asp Asn Leu 
340 345 350 

agg gca ara tgg aat ttt tea act ytr tgk gtt cgm aga rtt ttt age 1104 

Arg Ala Xaa Trp Asn Phe Ser Thr Xaa Xaa Val Xaa Arg Xaa Phe Ser 
355 360 365 



1152 
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atw ggc cgt gat tat tcy ttg gey gtc 
Xaa Gly Cys Asp Tyr Xaa Leu Xaa Val 
385 390 

acm ctg gga ggc ttt tgc tct tct gga 
Xaa Leu Gly Gly Phe Cys Ser Ser Gly 
405 

att get cct teg tat get ggt aty etc 
He Ala Pro Ser Tyr Ala Gly Xaa Leu 
420 425 

gee act att ccw gga atg rtt ggg cec 
Ala Thr He Xaa Gly Met Xaa Gly Pro 
435 440 

cct gak aac act rtt gga gaa tgg caa 
Pro Xaa Asn Thr Xaa Gly Glu Trp Gin 
450 455 

get aty aat gtw ttt ggt gcc att ttc 
Ala Xaa Asn Xaa Phe Gly Ala He Phe 
465 470 

gaa gtr caa aac tgg gey mtc art gat 
Glu Xaa Gin Asn Trp Xaa Xaa Xaa Asp 
48S 



PCT/US99/16676 



gcw ttc eta acy ata tea aca 1200 
Xaa Phe Leu Xaa He Ser Thr 
395 400 

ttt age ate aac cat ctg gay 1248 
Phe Ser He Asn His Leu Xaa 
410 415 

ctg ggc ate aca aat acm ttt 1296 
Leu Gly He Thr Asn Xaa Phe 
430 

rtc att gey ara agt ctk ace 1344 
Xaa He Xaa Xaa Ser Xaa Thr 
445 

acy gtk ttc try aty get get 1392 
Xaa Xaa Phe Xaa Xaa Ala Ala 
460 

tty aca eta ttc gcc aaa ggt 1440 
Xaa Thr Leu Phe Ala Lys Gly 
475 480 

cac caw gga cac aga mac 1485 
His Xaa Gly His Arg Xaa 
490 495 



<210> 10 
<211> 495 
<212> PRT 

<213> Artificial Sequence 
<400> 10 

Met Xaa Xaa Pro Val Xaa Asp Xaa Ala Xaa Xaa Xaa Gly Glu Glu Xaa 
15 10 15 

Xaa Asp Arg Xaa Xaa Xaa Xaa Xaa Xaa Ala Xaa Arg Xaa Glu Xaa Ala 
20 25 30 

Pro Xaa Cys Cys Ser Ala Arg Tyr Asn Xaa Ala Xaa Leu Xaa Phe Phe 
35 40 45 

Gly Phe Phe Xaa Xaa Tyr Xaa Leu Xaa Val Asn Leu Xaa Val Xaa Xaa 
50 55 60 
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Val Xaa Mec Xaa Asp Ser Xaa Thr Thr Xaa Xaa Asp Asn Arg Xaa Ser 
^5 70 75 * 80 

Xaa Xaa Cys Xaa Glu His Ser Ala Pro He Lys Val Xaa Xaa Xaa Gin 
85 90 95 

Thr Gly Xaa Lys Tyr Xaa Trp Asp Ala Glu Thr Gin Gly Trp lie Leu 
100 105 110 

Xaa Xaa Phe Xaa Tyr Gly Tyr He He Thr Xaa He Pro Gly Gly Tyr 
H5 120 125 

Val Ala Ser Xaa Xaa Gly Gly Xaa Xaa Xaa Leu Gly Xaa Gly He Xaa 
130 135 140 

Xaa Xaa Ala Xaa Xaa Thr Leu Phe Thr Pro Xaa Ala Ala Asp Xaa Gly 
145 150 155 160 

Xaa Gly Xaa Xaa Xaa Xaa Leu Xaa Ala Leu Glu Xaa Leu Gly Glu Gly 
165 170 175 

Xaa Thr Xaa Pro Ala Met His Ala Met Trp Ser Xaa Trp Ala Pro Pro 
180 185 190 

Leu Glu Arg Ser Xaa Leu Xaa Xaa He Xaa Tyr Ala Gly Ala Xaa Leu 
195 200 205 

Gly Thr Val Xaa Ser Leu Pro Leu Ser Gly Xaa He Cys Tyr Tyr Met 
210 215 220 

Asn Trp Thr Tyr Val Phe Xaa Xaa Phe Xaa Xaa Xaa Gly Xaa Xaa Trp 
225 230 235 240 

Phe Xaa Xaa Trp He Xaa Leu Val Ser Xaa Thr Pro Xaa Xaa His Lys 

245 - 250 - 255 - 

Xaa Xaa Xaa Xaa Tyr Glu Lys Xaa Xaa He Leu Ser Ser Leu Xaa Asn 
260 265 270 

Gin Xaa Ser Ser Gin Lys Ser Val Pro Trp Xaa Xaa Xaa Xaa Lys Xaa 
275 280 285 

Leu Pro Leu Trp Ala Xaa Xaa Val Ala Xaa Phe Ser Tyr Asn Trp Thr 
290 295 300 

Phe Tyr Thr Xaa Leu Xaa Leu Leu Pro Thr Xaa Met Lys Xaa Xaa Leu 
305 310 315 320 
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Arg Phe Asn Xaa Gin Glu Asn Gly Phe Leu Ser Xaa Xaa Pro Tyr Leu 

335 

xaa xaa Trp Leu Cys Met He Leu Xaa Gly Gin Ala Ala Asp Asn Leu 
340 345 350 

Arg Ala Xaa Trp Asn Phe Ser Thr Xaa Xaa Val Xaa Arg Xaa Phe Ser 
355 360 

Leu lie xaa Met He Gly Pro Xaa Xaa Phe Leu Xaa Xaa Xaa Xaa Xaa 
370 375 380 



Xaa Gly Cys Asp Tyr Xaa Leu Xaa Val Xaa Phe Leu Xaa He Ser Thr 

400 



385 390 395 



Xaa Leu Gly Gly Phe Cys Ser Ser Gly Phe Ser He Asn His Leu Xaa 
405 43^5 

He Ala Pro Ser Tyr Ala Gly Xaa Leu Leu Gly He Thr Asn Xaa Phe 
420 425 430 

Ala Thr He Xaa Gly Met Xaa Gly Pro Xaa He Xaa Xaa Ser Xaa Thr 
435 440 445 

Pro Xaa Asn Thr Xaa Gly Glu Trp Gin Xaa Xaa Phe Xaa Xaa Ala Ala 
450 455 460 

Ala Xaa Asn Xaa Phe Gly Ala He Phe Xaa Thr Leu Phe Ala Lys Gly 
465 470 475 480 

Glu Xaa Gin Asn Trp Xaa Xaa Xaa Asp His Xaa Gly His Arg Xaa 
4B5 490 495 



<210> 11 
<211> 1485 
<212> DKA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: human/sheep 
consencus sequence 

<220> 

<221> CDS 

<222> (1) . , (1485) 

<400> 11 
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acg ang ten ccg gtt nnn gac ncn gcc cng anc gan ggc gag gag ngc 43 
Met Xaa Xaa Pro Val Xaa Asp Xaa Ala Xaa Xaa Xaa Gly Glu Glu Xaa 
15 10 15 

ncg gac cgc acn ccn ctn ctn cng ngc gcc ccn egg gen gaa nec get 95 
Xaa Asp Arg Xaa Xaa Xaa Xaa Xaa Xaa Ala Xaa Arg Xaa Glu Xaa Ala 
20 25 30 

cea gtn tge tgc tet get cgt tac aac nta gea ntt ttg nec ttt ttt 144 
Pro Xaa Cys Cys Ser Ala Arg Tyr Asn Xaa Ala Xaa Leu Xaa Phe Phe 
35 40 45 

ggt tte ttc ntt ntn tat nea tta egn gtg aat ctg agn gtt gen nta 192 
Gly Phe Phe Xaa Xaa Tyr Xaa Leu Xaa Val Asn Leu Xaa Val Xaa Xaa 
50 55 60 

gtg gan atg gtn gat tea aan aca act nnn naa gat aat aga acn tee 240 
Val Xaa Met Xaa Asp Ser Xaa Thr Thr Xaa Xaa Asp Asn Arg Xaa Ser 
65 70 75 . 80 

nan gng tgt nea gag cat tet get ecc ata aaa gtt cnt can aan caa 288 
Xaa Xaa Cys Xaa Glu His Ser Ala Pro He Lys Val Xaa Xaa Xaa Gin 
85 90 95 

acg ggt aan aag tac cnn tgg gat gca gaa act caa gga tgg att etc 336 
Thr Gly Xaa Lys Tyr Xaa Trp Asp Ala Glu Thr Gin Gly Trp He Leu 
100 105 110 



384 



ggn ten ttt ttn tat ggc tac ate ate aca can att cet gga gga tat 
Xaa Xaa Phe Xaa Tyr Gly Tyr He He Thr Xaa He Pro Gly Gly Tyr 
115 120 125 

gtt gcc age ana ann ggg ggg aan ntg ntg eta gga ttn ggg ate ntt 432 
Val Ala Ser Xaa Xaa Gly Gly Xaa Xaa Xaa Leu Gly Xaa Gly He Xaa 
130 i35 ' 140 " 

gnn acn get ntc ntc ace ctg tte act cec ntn get gca gat ttn gga 480 
Xaa Xaa Ala Xaa Xaa Thr Leu Phe Thr Pro Xaa Ala Ala Asp Xaa Gly 
145 ISO 155 160 

gtn gga ncn ctn ntt gna etc agn gca eta gaa ggn eta gga gag ggt 528 
Xaa Gly Xaa Xaa Xaa Xaa Leu Xaa Ala L.eu Glu Xaa Leu Gly Glu Gly 
165 170 175 

gtn aca tnt cea gee atg cat gcc atg tgg tet ten tgg get cec cet 576 
Xaa Thr Xaa Pro Ala Met His Ala Met Trp Ser Xaa Trp Ala Pro Pro 
180 185 190 
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etc gaa aga age aan etc etn agn ate ten cat gca gga gca can ctt 624 
Leu Glu Arg Ser Xaa Leu Xaa Xaa lie Xaa Tyr Ala Gly Ala Xaa Leu 
195 200 205 

ggg aca gta ntt tct ctt cct ctt tct gga nta ate tge tac tat atg 672 
Gly Thr Val Xaa Ser Leu Pro Leu Ser Gly Xaa He Cys Tyr Tyr Met 
210 215 220 

aat tgg act tat gee ttc tan ttn ttt ggn ant ntt gga atn ntn tgg 720 
Asn Trp Thr Tyr Val Phe Xaa Xaa Phe Xaa Xaa Xaa Gly Xaa Xaa Trp 
225 230 235 240 

ttt ntt ttn tgg ate tgn tta gtt agt gan aca cca naa ann cac aag 768 
Phe Xaa Xaa Trp He Xaa Leu Val Ser Xaa Thr Pro Xaa Xaa His Lys 
245 250 255 

ana atn ncn cnn tat gaa aag gan tan att ctt tea tea tta ana aat 816 
Xaa Xaa Xaa Xaa Tyr Glu Lys Xaa Xaa He Leu Ser Ser Leu Xaa Asn 
260 265 270 

cag ctn tct tea eag aag tea gtg ccg tgg nta ccn atn ntn aaa ten 864 
Gin Xaa Ser Ser Gin Lys Ser Val Pro Trp Xaa Xaa Xaa Xaa Lys Xaa 
275 280 285 

ctg cca ctt tgg get atn gtn gtt gca can ttt tct tac aac tgg act 912 
Leu Pro Leu Trp Ala Xaa Xaa Val Ala Xaa Phe Ser Tyr Asn Trp Thr 
290 295 300 

ttt tat act ttn ttg acn tta ttg cct act taui atg aag gan ntc eta 960 
Phe Tyr Thr Xaa Leu Xaa Leu Leu Pro Thr Xaa Met Lys Xaa Xaa Leu 
305 310 315 320 

agg ttc aat ntt caa gag aat ggg ttt tta tct nea ntn cct tat tta 1008 
Arg Phe Asn Xaa Gin Glu Asn Gly Phe Leu Ser Xaa Xaa Pro Tyr Leu 
325 330 335 

ggn tnt tgg tta tgt atg ate ctg ten ggt caa get get gae aat tta 1056 
Xaa Xaa Trp Leu Cys Met He Leu Xaa Gly Gin Ala Ala Asp Asn Leu 
340 345 350 

agg gca ana tgg aat ttt tea act ntn tgn gtt cgn aga ntt ttt age 1104 
Arg Ala Xaa Trp Asn Phe Ser Thr Xaa Xaa Val Xaa Arg Xaa Phe Ser 
355 360 365 

ctt ata ggn atg att gga cct gen nta ttc ctg gtn gen gen ggn ttn 1152 
Leu He Xaa Met He Gly Pro Xaa Xaa Phe Leu Xaa Xaa Xaa Xaa Xaa 
370 375 380 
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acn ggc cgt gat tac ten ctg gen gtt gen tte^cta acn aca tea aca 1200 
Xaa Gly Cys Asp Tyr Xaa Leu Xaa Val Xaa Phe Leu Xaa He Ser Thr 

390 395 

acn czg gga ggc ttt tgc tet tct gga ttt age ate aac cat ctg gan 1248 
Xaa Leu Gly Gly Phe Cys Ser Ser Gly Phe Ser He Asn His Leu Xaa 
405 410 415 

att get cct teg tat get ggt atn etc ctg ggc ate aca aat acn ttt 1296 
He Ala Pro Ser Tyr Ala Gly Xaa Leu Leu Gly He Thr Asn Xaa Phe 
420 425 430 

gee act att cen gga atg ntt ggg ccc ntc att gen ana agt ctn acc 1344 
Ala Thr He Xaa Gly Met Xaa Gly Pro Xaa He Xaa Xaa Ser Xaa Thr 
435 440 445 

cct gan aac act ntt gga gaa tgg caa acn gtn ttc tnn atn get get 1392 
Pro Xaa Asn Thr Xaa Gly Glu Trp Gin Xaa Xaa Phe Xaa Xaa Ala Ala 
450 455 460 

get atn aat gtn ttt ggt gee att ttc ttn aca eta ttc gcc aaa ggt 1440 
Ala Xaa Asn Xaa Phe Gly Ala He Phe Xaa Thr Leu Phe Ala Lys Gly 
4€5 470 475 48O 

gaa gtn caa aac tgg gen ntc ant gat cac can gga cac aga nac 1435 
Glu Xaa Gin Asn Trp Xaa Xaa Xaa Asp His Xaa Gly His Arg Xaa 
485 490 495 



<210> 12 
<211> 495 
<212> PRT 

<213> Artificial Sequence 
<400> 12 - 

Met Xaa Xaa Pro Val Xaa Asp Xaa Ala Xaa Xaa Xaa Gly Glu Glu Xaa 
^ 5 10 15 

Xaa Asp Arg Xaa Xaa Xaa Xaa Xaa Xaa Ala Xaa Arg Xaa Glu Xaa Ala 
20 25 30 

Pro Xaa Cys Cys Ser Ala Arg Tyr Asn Xaa Ala Xaa Leu Xaa Phe Phe 
35 40 45 

Gly Phe Phe Xaa Xaa Tyr Xaa Leu Xaa Val Asn Leu Xaa Val Xaa Xaa 
50 55 60 

Val Xaa Met Xaa Asp Ser Xaa Thr Thr Xaa Xaa Asp Asn Arg Xaa Ser 
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Xaa Xaa Cys Xaa Glu His Ser Ala Pro He Lys Val Xaa Xaa Xaa Gin 
85 90 55 

Thr Gly Xaa Lys Tyr Xaa Trp Asp Ala Glu Thr Gin Gly Trp He Leu 
100 105 110 

Xaa Xaa Phe Xaa Tyr Gly Tyr He He Thr Xaa He Pro Gly Gly Tyr 
115 120 125 

Val Ala Ser Xaa Xaa Gly Gly Xaa Xaa Xaa Leu Gly Xaa Gly He Xaa 
130 135 140 

Xaa Xaa Ala Xaa Xaa Thr Leu Phe Thr Pro Xaa Ala Ala Asp Xaa Gly 
145 150 155 

Xaa Gly Xaa Xaa Xaa Xaa Leu Xaa Ala Leu Glu Xaa Leu Gly Glu Gly 
165 170 175 

Xaa Thr Xaa Pro Ala Met His Ala Met Trp Ser Xaa Trp Ala Pro Pro 
180 185 190 

Leu Glu Arg Ser Xaa Leu Xaa Xaa He Xaa Tyr Ala Gly Ala Xaa Leu 
195 200 205 

Gly Thr Val Xaa Ser Leu Pro Leu Ser Gly Xaa He Cys Tyr Tyr Met 
210 215 220 

Asn Trp Thr Tyr Val Phe Xaa Xaa Phe Xaa Xaa Xaa Gly Xaa Xaa Trp 
225 230 235 240 

Phe Xaa Xaa Trp He Xaa Leu Val Ser Xaa Thr Pro Xaa Xaa His Lys 
245 250 255 

Xaa Xaa Xaa Xaa Tyr Glu Lys Xaa Xaa He Leu Ser Ser Leu Xaa Asn 
260 265 270 

Gin Xaa Ser Ser Gin Lys Ser Val Pro Trp Xaa Xaa Xaa Xaa Lys Xaa 
275 280 285 

Leu Pro Leu Trp Ala Xaa Xaa Val Ala Xaa Phe Ser Tyr Asn Trp Thr 
290 295 300 

Phe Tyr Thr Xaa Leu Xaa Leu Leu Pro Thr Xaa Met Lys Xaa Xaa Leu 
305 310 315 320 

Arg Phe Asn Xaa Gin Glu Asn Gly Phe Leu Ser Xaa Xaa Pro Tyr Leu 
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325 



330 



335 



Xaa Xaa Trp Leu Cys Met lie Leu Xaa Gly Gin Ala Ala Asp Asn Leu 
340 345 350 

Arg Ala Xaa Trp Asn Phe Ser Thr Xaa Xaa Val Xaa Arg Xaa Phe Ser 
355 360 365 

Leu He Xaa Met He Gly Pro Xaa Xaa Phe Leu Xaa Xaa Xaa Xaa Xaa 
370 375 380 

Xaa Gly Cys Asp Tyr Xaa Leu Xaa Val Xaa Phe Leu Xaa He Ser Thr 
385 390 395 400 

Xaa Leu Gly Gly Phe Cys Ser Ser Gly Phe Ser He Asn His Leu Xaa 
405 410 415 

He Ala Pro Ser Tyr Ala Gly Xaa Leu Leu Gly He Thr Asn Xaa Phe 
420 425 430 

Ala Thr He Xaa Gly Met Xaa Gly Pro Xaa He Xaa Xaa Ser Xaa Thr 
435 440 445 

Pro Xaa Asn Thr Xaa Gly Glu Trp Gin Xaa Xaa Phe Xaa Xaa Ala Ala 
450 455 460 

Ala Xaa Asn Xaa Phe Gly Ala He Phe Xaa Thr Leu Phe Ala Lys Gly 
465 470 475 480 



Glu Xaa Gin Asn Trp Xaa Xaa Xaa Asp His Xaa Gly His Arg Xaa 
485 490 495 
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GBS TOXIN RECEPTOR 
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TNTROPUCTIQN 

Technical Field 

5 This invention provides compositions and methods relating to GBS toxin 

receptor polynucleotides and polypeptides. The invention relates to a receptor for a 
polysaccharide isolated from a bacterial source. 

Background 

Group B B-hemolytic Streptococci (GBS) are ubiquitous microorganisms. 
10 GBS is not known to cause any harmful infections in humans except for very young 
babies. GBS pneimionia, also called "early-onset disease", is associated with high 
morbidity and mortality in newborn infants. 

In a series of studies conducted by Dr. Carl G. Hellerqvist and his associates at 
the Vanderbilt University School of Medicine, Nashville, Tennessee, a polysaccharide 
1 5 GBS toxin was identified. This toxin was determined to be a major factor in the 

complications of GBS pneumonia, and was found to be useful as a therapeutic agent 
in combating timiors though inhibition of vascularization (U.S. Patent No. 5,010,062). 

In addition, as described in U.S. Patent No. 5,858,991 and W098/32453, GBS 
toxin facilitates wound healing in patients by minimizing scarring and accelerating 
20 healing, and reduces wound-related tumor progression. 

W098/32452 and W098/32448 describe the use of GBS toxin as a therapeutic 
agent for treating patients with chronic inflammatory diseases, such as rheumatoid 
arthritis and psoriasis, and for enhancing repair of neural injury. 

Prior to this invention, receptors for GBS toxin had not been identified. The 
25 inventors, believing receptors of GBS toxin to reside on cells in the developing 
vasculature of tissues undergoing angiogenesis in the conditions described above, 
embarked upon a series of experiments resulting in the present invention. 



1. 
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SUMMARY OF THE IN\/ENTTON 
For the first time, novel receptors for group B p-hemolytic Streptococcus GBS 
toxin (GBS toxin receptor) have been identified. One aspect of the invention provides 
a polypeptide comprising a GBS toxin receptor or polypeptide fragment thereof 
5 Preferred embodiments include mammalian GBS toxin receptors. Also provided is an 
antibody that recognizes GBS toxin receptor or a fragment thereof. The polypeptide 
of the invention can be used, inter alia, for the screening of compounds that can be 
used to treat or prevent conditions arising from pathologic or hypoxia-driven 
angiogenesis or neovascularization, such as, for example, cancerous tumors, chronic 
10 inflammatory disease, scarring during wound healing, keloids, neural injury, and 
reperfiision injury. 

Another aspect of the invention provides a polynucleotide encoding a GBS 
toxin receptor or a fragment thereof and a polynucleotide hybridizable to such 
polynucleotide. Preferred polynucleotides are at least 1 0 bases in length and comprise 

15 a nucleic acid sequence encoding, or are complementary to a nucleic acid sequence 
encoding, a mammalian GBS toxin receptor or a polypeptide fi-agment thereof. 

A third aspect of the invention is a complex comprising a GBS toxin bound to 
a mammalian toxin receptor or fragment thereof. Also provided is a method of 
forming such complex. The method comprises contacting a GBS toxin with a 

20 polypeptide comprising a mammalian GBS toxin receptor, or fragment thereof that 
can bind GBS toxin, under conditions that permit specific binding of the GBS toxin to 
the polypeptide, and allov^dng the complex to form. 

Yet another aspect of the invention is a method for purifying a compound that 
binds a GBS toxin receptor. The method comprises providing a polypeptide 

25 comprising a mammalian GBS toxin receptor, or firagment thereof that binds GBS 
toxin, contacting the polypeptide with a sample comprising the compound under 
conditions that allow specific binding of the compound to the polypeptide, and 
separating the boimd compound firom the remainder of the sample. 

Another aspect of the invention is a method of determining the presence or 

30 absence of GBS toxin in a sample. The method comprises contacting the sample with 
a polypeptide comprising a mammalian GBS toxin receptor, or fragment thereof that 
binds GBS toxin, under conditions that allow specific binding of GBS toxin to the 
GBS toxin receptor, and detennining whether specific binding of GBS toxin has 
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occurred. Presence of GBS toxin in a sample obtained from a neonate is indicative of 
early onset disease. 

A sixth aspect of the invention is a method for detecting pathologic 
vasculature in a mammalian tissue. The method comprises detecting the presence of a 
5 GBS toxin receptor. The method can be used for detecting or monitoring a variety of 
medical conditions associated v^ith angiogenesis or neovascularization, such as, for 
example, detecting metastasis of a cancerous tumor, or monitoring the margin of a 
tumor in a mammal undergoing a therapy for cancer. 

Another aspect of the invention provides methods for the identification of drug 

10 candidates for the treatment of medical conditions characterized by pathologic and/or 
hypoxia-driven angiogenesis or neovascularization. One embodiment is a method for 
identifying a compoimd that specifically binds a mammalian GBS toxin receptor. The 
method comprises combining a test compound with a mammalian GBS toxin receptor, 
or fragment thereof that can bind GBS toxin, under conditions that allow specific 

1 5 binding to occur, and detecting a complex formed between the test compound and the 
polypeptide. Another embodiment is a method for determining cytotoxicity of a test 
chimeric compound. The method comprises exposing a cell expressing a mammalian 
GBS toxin receptor, or fragment thereof that binds GBS toxin, to a test chimeric 
compound comprising a cytotoxic agent coupled to GBS toxin, and detecting signs of 

20 toxicity. Yet another embodiment is a method for identifying an inhibitor of a GBS 
toxin receptor by incubating test cells that express GBS toxin receptor, or a fragment 
thereof, in the presence and absence of a test compound and under conditions in 
which the cells incubated in the absence of the test compound can proliferate or 
migrate, and comparing the proliferation or migration of the test cells incubated in the 

25 presence and absence of the test compound, wherein less proliferation or migration in 
the presence of the test compoimd is indicative of the test compound being an 
inhibitor of the GBS toxin receptor. An inhibitor of endothelial cell proliferation or 
migration can be identified by the above method, wherein less proliferation or 
migration of test cells in the presence of the test compound is indicative of the test 

30 compound being an inhibitor of endothelial cell proliferation or migration. A 

therapeutic compound for the treatment or prevention of a medical condition 

characterized by pathologic angiogenesis or neovascularization can also be identified 

by the above method, wherein less proliferation or migration of test cells in the 

presence of the test compound is indicative of the test compound being a candidate 

3. 
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therapeutic compound for the treatment or prevention of the medical condition. 

The invention also provides a method for identifying a compound which 
inhibits binding of a GBS toxin to a mammalian GBS toxin receptor. The method 
coniprises simulating and selecting the most probable conformations of a mammalian 
5 GBS toxin receptor, designing a chemically modified analog that substantially mimics 
the energetically most probable three-dimensional structure of the polypeptide, 
chemically synthesizing the analog, and evaluating the bioactivity of the analog. Also 
provided is a method for identifying a compound which binds to a mammalian GBS 
toxin receptor. The method comprises simulating and selecting the most probable 

1 0 conformations of a mammalian GBS toxin receptor, deducing the most probable 
binding domains of the polypeptide, designing a compound that would form the 
energetically most probable complexes with the polypeptide, chemically synthesizing 
the compound, and evaluating the bioactivity of the compound. 

Another aspect of the invention is a method for the prevention or treatment of 

15 neonatal onset disease in a human neonate by administering an inhibitor of binding of 
GBS toxin to a human GBS toxin receptor. 

Yet another aspect of the invention is a method for inhibiting pathologic or 
hypoxia-driven endothelial cell proliferation or migration in a mammalian tissue. The 
method comprises specifically binding a molecule to a GBS toxin receptor present on 

20 the surface of at least one cell in the tissue, the molecule being selected firom the 
group consisting of a compound that can evoke an inflammatory response when 
bound to a GBS toxin receptor in a mammal, a chimeric compound comprising a 
cytotoxic compound coupled to a compound that specifically binds the GBS toxin 
receptor, an inhibitor of GBS toxin receptor phosphorylation, and an inhibitor of GBS 

25 toxin receptor activity. 

The invention also provides a GBS toxin receptor or fragment thereof, an 
inhibitor of a GBS toxin receptor, or an inhibitor of binding of a GBS toxin to a GBS 
toxin receptor, for use in a method of treatment of the human or animal body or for 
the manufacture of a medicament for the treatment of a medical condition 

30 characterized by pathologic or hypoxia-driven angiogenesis or neovascularization. 
Also provided is a chimeric compound comprising a cytotoxic agent coupled to a 
compound that binds GBS toxin receptor for use in a method of treatment of the 
human or animal body. 

Also provided are pharmaceutical compositions comprising an inhibitor of a 
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GBS toxin receptor and/or a chimeric compound comprising a cytotoxic agent 
coupled to a compound that binds GBS toxin receptor, and a pharmaceutically 
acceptable carrier. 

The invention also provides kits comprising a GBS toxin receptor or fragment 
5 and/or reagents for detecting the presence of a GBS toxin receptor or polypeptide 
fragment thereof or the presence of a polynucleotide encoding same. 

BRIEF DESCRIPTTON OF THE DRAWINGS 
FIG. 1 depicts a process of rational drug design. 

10 FIGS. 2 A and 2B depict the results of immunohistochemical analysis of GBS 

toxin receptor expression in cancerous and normal human ovary tissue, respectively, 
using antibody Pab55 as described in Example 4. 

FIGS. 3 A and 3B depict the results of immunohistochemical analysis of GBS 
toxin receptor expression in cancerous and normal human ovary tissue, respectively, 
15 using antibody Pab57 as described in Example 4, 

FIGS. 4A-4C depict the targeted delivery of a chimeric compound to GBS 
toxin receptor expressed in a cancerous tissue as described in Example 6, 

DESCRIPTION OF SPECIFIC EMBODIMENTS 

20 Definitions 

Generally, all technical and scientific terms used herein have the same 

„ „ meaning as .conMionly .understoodJ)y_one of ordinary skill in the art to which this 

invention belongs. The nomenclature used herein and the laboratory procedures in 
cell culture, molecular genetics, and nucleic acid chemistry and hybridization 
25 described below are those well knowni and commonly employed in the art. Standard 
techniques are used for recombinant nucleic acid methods, polynucleotide synthesis, 
and microbial culture and transformation (e.g., electroporation, lipofection). 
Enzymatic reactions and purification steps supplied by manufacturers are typically 
performed according to the manufacturer's specifications. The techniques and 
30 procedures are generally performed according to conventional methods in the art and 
various general references (See generally, Sambrook et al.. Molecular Cloning: A 
Laboratory Manual^ 2d ed. (1989) Cold Spring Harbor Laboratory Press, Cold Spring 

5. 
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Harbor, N.Y.) which are provided throughout this document. The nomenclature used 
herein and the laboratory procedures in analytical chemistry, organic synthetic 
chemistry, and pharmaceutical formulation described below are those well known and 
commonly employed in the art. Standard techniques can be used for chemical 
5 syntheses, chemical analyses, pharmaceutical formulation and delivery, and treatment 
of patients. As employed throughout the disclosure, the following terms, unless 
otherwise indicated, shall be understood to have the following meanings: 

By "GBS toxin receptor" is meant a proteinaceous molecule capable of 
binding a toxin from Group B P-hemolytic Streptococcus bacteria (GBS toxin), such 

1 0 as, for example, CMlOl . A GBS toxin receptor is usually found in nature on the 

surface of a cell. Recombinant membrane bound and soluble GBS toxin receptors can 
be produced by laboratory techniques known in the art and described herein. 

The term "isolated polynucleotide" referred to herein means a polynucleotide 
that has been subjected to manipulation, such that the isolated polynucleotide is no 

1 5 longer associated with the chromosome or cell that the polynucleotide is normally 
associated with in nature in the same manner as it is normally associated in nature. 
An example of an "isolated polynucleotide" is a polynucleotide of genomic, 
recombinant, or synthetic origin or some combination thereof. 

The term "isolated protein" referred to herein means a protein that is no longer 

20 associated with the cell that the protein is normally associated with in nature in the 
same manner as it is normally associated in nature, such as (1) a protein free of at 
least some other proteins from the same source, (2) a protein expressed by a cell from 
a different species, (3) a protein that does not occur in nature, and (4) a protein 
produced bom cDNA, recombinant RNA, or synthetic origin or some combination 

25 thereof. 

The tenn "polypeptide" is used herein as a generic term to refer to native 
protein, fragments, or analogs of a polypeptide sequence. Hence, native protein, 
fragments, and analogs are species of the polypeptide genus. 

The term "naturally occurring" means found in nature. For example, a 
30 polypeptide or polynucleotide sequence that is present in an organism (including 

viruses) found in nature and which has not been intentionally modified by man in the 
laboratory is naturally-occurring. 

The term "operably linked" refers to a juxtaposition wherein the components 
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SO described are in a relationship permitting them to function in their intended 
manner. A control sequence *'operably linked" to a coding sequence is ligated in such 
a way that expression of the coding sequence is achieved under conditions compatible 
with the control sequences. 
5 The term ''control sequence" refers to polynucleotide sequences which are 

necessary to effect the expression of coding sequences to which they are ligated. The 
nature of such control sequences differs depending upon the host organism; in 
prokaryotes, such control sequences generally include promoter, ribosomal binding 
site, and transcription termination sequence; in eukaryotes, generally, such control 
1 0 sequences include promoters and transcription termination sequence. The term 
"control sequences" is intended to include, at a minimum, all components whose 
presence is necessary for expression, and can also include additional components 
whose presence is advantageous, for example, leader sequences and fusion partner 
sequences. 

15 The term "polynucleotide" as referred to herein means a polymeric form of 

nucleotides of at least 10 bases in length, either ribonucleotides or 
deoxyribonucleotides or a modified form of either type of nucleotide. The term 
includes single- and double-stranded forms of DNA. 

The term ^^oligonucleotide" referred to herein includes naturally occurring, and 

20 modified nucleotides linked together by naturally occurring and non-naturally 

occurring oligonucleotide linkages. An oligonucleotide is usually a polynucleotide 
200 bases or fewer in length. Preferably oligonucleotides are minimally 10 to 60 
bases in length and most preferably 

15-35 bases in minimal length. Oligonucleotides are usually single-stranded, e.g. for 
25 probes; although oligonucleotides may be double-stranded, e.g. for use in the 

construction of a gene mutant. Oligonucleotides of the invention can be either sense 
or antisense oligonucleotides. The term "naturally occurring nucleotides" referred to 
herein includes deoxyribonucleotides and ribonucleotides. The term "modified 
nucleotides" referred to herein includes nucleotides with modified or substituted sugar 
30 groups and the like. The term "oligonucleotide linkages" referred to herein includes 
oligonucleotides linkages such as phosphorothioate, phosphorodithioate, 
phosphoroselenoate, phosphorodiselenoate, phosphoroanilothioate, phoshoraniladate, 
phosphoroamidaie, and the like. An oligonucleotide can include a label for detection, 
if desired. 

. 7. 

BNSDOCID: <WO 0005375A1_IA> 



wo 00/05375 PCT/US99/16676 

By "complementary'' or "complement" is meant that wherever adenine 
appears in a first nucleic acid sequence, thymine or uracil is found in the 
''complementary" sequence and vice versa, and wherever guanine appears in a first 
nucleic acid sequence, cytosine is found in the ''complementary'' sequence and vice 
5 versa. 

The term "sequence identity" describes the proportion of base matches 
between two nucleic acid sequences or the proportion of amino acid matches between 
two amino acid sequences, i.e. the degree of identity between two sequences. When 
sequence identity is expressed as a percentage, e.g., 50%, the percentage denotes the 

10 proportion of exact matches over the length of sequence from a GBS toxin receptor 
sequence that is compared to some other sequence. Various computer alignment 
programs can be used to determine sequence identity. In its simplest form, % identity 
is calculated by dividing the number of exact matches between two nucleic acid 
sequences or between two amino acid sequences by the total number of nucleotides or 

15 amino acids in the reference sequence. For example, if there are 300 matches 
between sequences 400 amino acids in length, the sequences have 75% identity. 
Uracil and thymine are considered identical v/hen comparing a ribonucleic acid 
sequence with a deoxyribonucleic acid sequence. 

As applied to polynucleotides, the term "substantial identity" means that two 

20 nucleic acid sequences when optimally aligned, such as by the program BLAST 
(Altschul et al., 1 Mol Biol 215:403-410 (1990)), share at least about 85%, 
preferably at least about 90% sequence identity and most preferably 95% or greater 
sequence identity. When using computer alignment programs, gaps (in either of the 
two sequences) are permitted to maximize matching; gap lengths of 15 bases or less 

25 are usually used; 6 bases or less are preferred; 2 bases or less are most preferred. 
When using oligonucleotides as probes or in treatments, the sequence identity 
between the target nucleic acid and the oligonucleotide sequence is generally not less 
than 17 target base matches out of 20 possible oligonucleotide base pair matches 
(85%); preferably not less than 9 matches out of 10 possible base pair matches (90%), 

30 and most preferably not less than 1 9 matches out of 20 possible base pair matches 
(95%). 
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Preferably, bases which are not identical nevertheless are part of a degenerate 
codon that encodes the same amino acid at that amino acid position. Alternatively, 
bases which are not identical preferably are part of a degenerate codon that encodes a 
conservative amino acid substitution for that amino acid position. 
5 As applied to polypeptides, the term "substantial identity" means that two 

peptide sequences, when optimally aligned by the BLAST computer program, share at 
least about 80 percent sequence identity, preferably at least about 86 percent sequence 
identity, more preferably at least about 95 percent sequence identity, even more 
preferably at least about 99 percent sequence identity up to having one amino acid 

10 difference, and most preferably share 100% identity. Gaps (in either of the two 

sequences being matched) are allowed in maximizing matching; gap lengths of 5 or 
less are preferred with 2 or less being more preferred. Preferably, residue positions 
which are not identical differ by conservative amino acid substitutions. Conservative 
amino acid substitutions refer to the interchangeability of residues having similar side 

1 5 chains. For example, a group of amino acids having aliphatic side chains is glycine, 
alanine, valine, leucine, and isoleucine; a group of amino acids having aliphatic- 
hydroxyl side chains is serine and threonine; a group of amino acids having amide- 
containing side chains is asparagine and glutamine; a group of amino acids having 
aromatic side chains is phenylalanine, tyrosine, and tryptophan; a group of amino 

20 acids having basic side chains is lysine, arginine, and histidine; and a group of amino 
acids having sulfur-containing side chains is cysteine and methionine. Preferred 
conservative amino acid substitution groups are: valine-leucine-isoleucine, 
phenylalanine-tyrosine, lysine-arginine, alanine-valine, glutamic-aspartic, and 
asparagine-glutamine, 

- 25 The term "hybridizable under high stringency conditions" referred to herein 

means capable of specific binding under conditions whereby only nucleic acid 
sequences having a substantial identity of greater than 95% with respect to each other 
will hybridize. These conditions are known in the art and discussed herein. 

The term "degenerate codon" means any of the nucleotide codon triplets 
30 encoding a desired amino acid according to the genetic code. Codons can be selected 
based upon known preferred codon usage in a host organism such as E, colt 

The term "polypeptide fragment" as used herein refers to a polypeptide that 
has an amino-terminal and/or carboxy-terminal deletion, but where the remaining 
amino acid sequence is identical to the corresponding positions in the naturally- 
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occurring sequence deduced, for example, from a full-length DNA sequence. 
Fragments typically are at least 3 amino acids long, preferably are 5-10 amino acids 
long, more preferably are 10-50 amino acids long, even more preferably are more 
than 50 amino acids long and comprise at least one extracellular domain of a GBS 
5 toxin receptor. Most preferred are fragments that comprise the entire extracellular 
domains of a GBS toxin receptor, and preferably also comprise portions of 
transmembrane and intracellular domains sufficient to maintain the polypeptide 
fragment in a functional stereochemical conformation on the surface of a cell, lipid 
membrane, liposome, micelle, or other lipophilic structure. 

10 The term "immunologically reactive" means having antigenic properties or 

being capable of being specifically bound by an antibody that can specifically bind 
GBS toxin receptor. A substance has antigenic properties if it can generate 
monoclonal or polyclonal antibodies when administered to an animal under conditions 
known in the art to facilitate the production of antibodies that will recognize and bind 

15 a particular antigen. 

A "heterologous polypeptide" is a polypeptide different from polypeptides 
normally produced by a particular cell. For example, a GBS toxin receptor 
polypeptide or fragment thereof that is produced recombinantly in a cell that does not 
normally produce such GBS toxin receptor polypeptide or fragment thereof, is a 

20 heterologous polypeptide, A second polypeptide joined to a GBS toxin receptor 

polypeptide or fragment thereof is also a heterologous polypeptide if it is not joined to 
a GBS toxin receptor polypeptide in nature. 

As used herein, the terms "label" or "labeled" refers to incorporation of a 
detectable marker, e.g., by incorporation of a radiolabeled amino acid or attachment to 

25 a polypeptide of biotinyl moieties that can be detected by marked avidin (e.g., 

streptavidin containing a fluorescent marker or enzymatic activity that can be detected 
by optical or colorimetric methods). Various methods of labeling polypeptides and 
glycoproteins are known in the art and may be used. Examples of labels for 
polypeptides include, but are not limited to, the following: radioisotopes (e.g., ^H, "*C, 

30 ^^S, '^'l), fluorescent labels (e.g., FITC, rhodamine, lanthanide phosphors), 
enzymatic labels (e.g., horseradish peroxidase, 6-galactosidase, luciferase, alkaline 
phosphatase), chemiluminescent, biotinyl groups, predetermined polypeptide epitopes 
recognized by a secondary reporter (e.g., leucine zipper pair sequences, binding sites 
for secondary antibodies, metal binding domains, epitope tags). In some 
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embodiments, labels are attached by spacer arms of various lengths to reduce 
potential steric hindrance. 

The term ''compound'' as used herein preferably refers to a peptidic, 
pep'tidomimetic, organic, or other chemical molecule and also refers to a nucleic acid 
5 molecule or chemical derivative thereof. The compound can interact with, or be, the 
polynucleotides or polypeptides of the invention. 

The singular forms "a", "an" and "the" include plural referents unless the 
context clearly dictates otherwise. 

The SEQ ID NOs of the nucleic acid and amino acid sequences described 
1 0 herein are summarized below in Table 1 . 



Table 1 

Nucleic Acid and Amino Acid Sequences 



SEQ ID NO: 


Type of Sequence 


Description 


SEQ ID NO: 1 


nucleic acid 


Partial human GBS toxin receptor (HPS 5) 


SEQ ID NO: 2 


amino acid 


Partial human GBS toxin receptor (HP55) 


SEQ ID NO: 3 


nucleic acid 


Sheep GBS toxin receptor (SP55) 


SEQ ID NO: 4 


amino acid 


Sheep GBS toxin receptor (SP55) 


SEQ ID NO: 5 


nucleic acid 


Primer 


SEQ ID NO: 6 


nucleic acid 


Primer 


SEQ ID NO: 7 


nucleic acid 


Full-length human GBS toxin receptor (HP59) 


SEQ ID NO: 8 


amino acid 


Full-length human GBS toxin receptor (HP59) 


SEQ ID NO: 9 


nucleic acid 


Human/Sheep consensus GBS toxin receptor 
coding region 

(with base codes a, c, g, t, m, r, w, s, y, k) 


SEQ ID NO: 10 


amino acid 


Human/Sheep consensus GBS toxin receptor 
coding region (translation of SEQ ID No: 9) 


SEQ ID NO: 11 


nucleic acid 


Human/Sheep consensus GBS toxin receptor 
coding region 

(with base codes a, c, g, t, n) 


SEQ ID NO: 12 


amino acid 


Human/sheep consensus GBS toxin receptor 
coding region (translation of SEQ ID NO: 1 1) 



The headings provided herein describe the general topic discussed and are not 
1 5 intended to be exclusive of information discussed in other sections. Frequently, 
information, methods, compositions, and other aspects may be applicable to more 
than one embodiment of the invention and can be so combined. 



Introduction 

20 GBS toxin binds to tissues undergoing pathologic, hypoxia-driven, and 
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embryologic angiogenesis or neovascularization. The inventors have identified at 
least wo mammalian GBS toxin receptors, which are described herein. Examples 1 
and 2 describe the cloning and characterization of some GBS toxin receptors. The 
inventors have classified GBS toxin receptor as an integral protein with seven 
5 transmembrane domains. The predicted segments are shown in Table 7. The protein 
has several putative sites for phosphorylation by cAMP-dependent kinase, protein 
kinase C (PKC), and casein kinase II (CK2). Typically, such integral proteins, upon 
binding of a molecule (e.g., a ligand or an extracellular messenger), undergo a 
conformational change which facilitates phosphorylation at phosphorylation sites such 

10 as those discussed above. The phosphorylation of the protein at these sites may 
trigger a signal transduction cascade, which often results in proliferation or other 
nuclear responses of the cells which have been exposed to the binding molecule. 
Angiogenesis or neovascularization involves proliferation and migration of 
endothelial cells. As discussed in greater detail in Examples 4 and 5, GBS toxin 

15 receptor expression is correlated with medical conditions involving pathologic, 
hypoxia-driven, and embryogenic angiogenesis or neovascularization. GBS toxin 
receptor polypeptides can be used for a variety of purposes, including screening for 
compounds that can inhibit endotheUal cell proliferation and/or migration mediated by 
GBS toxin receptor and screening for cytotoxic chimeric compounds that can bind to 

20 and destroy cells expressing GBS toxin receptor. GBS toxin receptor polynucleotides 
can be used for a variety of purposes, including the design of antisense 
polynucleotides that can block translation of messenger RNA encoding GBS toxin 
receptor. 

25 Polynucleotides 

One aspect of the invention provides for isolated polynucleotides at least ten 
bases in length encoding or complementary to a nucleic acid sequence encoding a 
GBS toxin receptor or a fragment derived therefrom. Preferably, the GBS toxin 
receptor is a manmiahan GBS toxin receptor, more preferably an ovine, bovine or 

30 feline GBS toxin receptor, and most preferably a human GBS toxin receptor. The 
isolated polynucleotides can be naturally occurring or non-naturally occurring. The 
isolated polynucleotides can comprise a DNA sequence or an RNA sequence in which 
every T is replaced with U. For purposes of determining percentage identity, T is 
considered equivalent to U. Preferably, the polynucleotides include alleles of an 
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ovine, bovine, feline or human GBS toxin receptor, and can include alleles of GBS 
toxin receptor of other mammals. These polynucleotides can be isolated using 
polynucleotides derived from SEQ ID NOs: 1, 3, 7, 9 and 1 1, as described further 
belcyw. 

5 Polynucleotides, oligonucleotides and fragments of the invention selectively 

hybridize to nucleic acid strands under hybridization and wash conditions that 
minimize appreciable amounts of detectable binding to nonspecific nucleic acids. The 
polynucleotides can be hybridizable under high stringency conditions to a nucleic acid 
molecule having a nucleic acid sequence comprising at least 20 contiguous 

10 polynucleotides, preferably at least 30 contiguous nucleotides of SEQ ID NO: 1 or 
SEQ ID NO: 3, and even more prefereably to the nucleic acid sequence of SEQ ID 
NO: 1, 3, 7, 9 or 1 1 or the complement of SEQ ID NO: 1, 3, 7, 9 or 1 1. Such 
polynucleotides can be used for performing selective, high stringency hybridization 
and are particularly useful for performing amplification of nucleic acid by polymerase 

1 5 chain reaction (PGR) to determine the presence or absence of GBS toxin receptor in a 
sample, for isolating a naturally occurring nucleic acid encoding a GBS toxin receptor 
(see Example 3), as antisense molecules for blocking translation of GBS toxin 
receptor mRNA. Particularly preferred are polynucleotides hybridizable under high 
stringency conditions to a nucleic acid molecule having a nucleic acid sequence 

20 comprising the nucleic acid sequence of nucleotides 266 to 1 870 of SEQ ID NO: 7 
(the putative fiill length coding region of a human GBS toxin receptor, excluding the 
start codon), nucleotides 266 to 1870 of SEQ ID N0:7 (the putative full length coding 
region of a human GBS toxin receptor, including the start codon), nucleotides 61 to 
1542 of SEQ ID NO:l (the partial coding region of a human GBS toxin receptor, 

25 excluding the start codon), nucleotides 58 to 1542 of SEQ ID NO: 1 (the partial 

coding region of a human GBS toxin receptor, including the start codon), nucleotides 
87 to 1568 of SEQ ID NO: 3 (the coding region of a sheep GBS toxin receptor, 
excluding the start codon), nucleotides 84 to 1568 of SEQ ID NO: 3 (the coding region 
of a sheep GBS toxin receptor, including the start codon), or a complementary nucleic 

30 acid sequence thereof 

The polynucleotides can have an identity to the nucleic acid sequence of a 
corresponding region of SEQ ID NO: 1 , 3 or 7 or the complement of a corresponding 
region of SEQ ID NO: 1, 3 or 7 in the range of about 85% to 100%, preferably greater 
than about 87% identity, more preferably greater than about 95% identity, and most 
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preferably about 99% to 100% identity. Particularly preferred are polynucleotides 
comprising the nucleic acid sequence of nucleotides 266 to 1870 of SEQ ID NO: 7, or 
nucleotides 87 to 1568 of SEQ ID NO: 3, SEQ ID NO: 9, SEQ ID NO:l 1, or a 
complementary nucleic acid sequence thereof. 
5 Preferably, the polynucleotides comprise a nucleic acid sequence encoding, or 

complementary to a nucleic acid sequence encoding, a polypeptide having an identity 
to the amino acid sequence of a firagment of a GBS toxin receptor in the range of 
about 85% to 100%, more preferably greater than 86% identity, even more preferably 
greater than 95% identity, and most preferably 99% to 100% identity. Preferably, the 

10 fragment binds GBS toxin. Preferred fragments comprise all or a portion of residues 
1 to 495 of SEQ ID NO: 2 or all or a portion of residues 1 to 536 of SEQ ID NO: 8. 
Particularly preferred are polynucleotides comprising a nucleic acid sequence 
encoding a polypeptide having 100% identity to the amino acid sequence of residues 
1 to 495 of SEQ ID NO: 4, residues 1 to 495 of SEQ ID NO: 2, or residues 1 to 536 of 

15 SEQIDNO:8. 

Polynucleotides encoding naturally occurring GBS toxin receptor can be 
isolated from various tissue sources and cell cultures from different species that 
produce such a receptor by the methods described herein, such as, for example, cells 
from tumor endothelium, synovial tissue in rheumatoid arthritis, or hypoxic tissue 

20 deprived of or restricted from blood flow, such asin reperfiision injury or wounded 
tissue. Such polynucleotides can be isolated by hybridization using probes or by 
polymerase chain reaction using oligonucleotides, as well as by implementing other 
molecular biology techniques known in the art. Such probes and oligonucleotides 
typically comprise various regions of the sequence of SEQ ID NO: 1, 3, 7, 9 or 1 1, 

25 preferably of SEQ ID NO: 1, 3, or 7, or encode various regions of the sequence of 
SEQ ID NO. 2, 4, 8,10 or 12, preferably of SEQ NO: 2, 4 or 8. 

Polynucleotides useful for cloning genes encoding GBS toxin receptors of 
various organisms can be determined by comparing the amino acid sequences of 
homologous proteins, (see Table 4). For example, conserved regions can be targeted 

30 for the synthesis of oligonucleotides or degenerate oligonucleotides to be used as 

probes for hybridization or nucleic acid amplification, techniques discussed further 

below and in Example 3. Stringency can be varied to achieve selective hybridization 

conditions whereby nucleic acid sequences having less than 95% identity with respect 

to each other will hybridize. These conditions are known in the art and discussed 
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herein and examples are provided. Generally, the nucleic acid sequence identity 
between the polynucleotides, oligonucleotides, and fragments of the invention and a 
nucleic acid sequence of interest will be at least about 85%, and more typically with 
preferably increasing identities of at least about 90%, 95%, 99%, and 100%. 
5 Polynucleotides can be used as probes under high stringency wash conditions 

and with corresponding hybridization conditions, as known in the art. Small 
polynucleotides, for example, polynucleotides 200 bases or fewer in length, are often 
referred to in the art as oligonucleotides. Techniques for using polynucleotides as 
probes to detect the same or related nucleic acid sequences is well known in the art. 

10 See, for example, Sambrook et al, especially Chapter 11, the text of which is herein 
incorporated by reference. Usually, probes can be made from polynucleotides that are 
10 to 200 bases in length. Preferably probes are made from polynucleotides 10 to 60 
nucleotides in length and most preferably 12 to 40 bases in length. Specific probes 
can be designed based on results obtained using nucleic acid homology computer 

15 programs such as FASTA, which uses the method of Pearson and Lipman {Proc. Natl. 
Acad. ScL USA 85:2444-2448 (1988)) and shows the degree of identity between 
compared sequences. The size of the probe is dependent upon the region of the gene 
to which it will be hybridized. The size of the probe increases as the degree of 
homology to undesirable nucleic acid sequences increases. A probe 10-50 nucleotides 

20 in length can be used, preferably more than 50 nucleotides, even more preferably 
more than 100 nucleotides, and most preferably a probe made from the entire coding 
region of a GBS toxin receptor will be used. To decrease the number of false 
positives, preferably two probes are used to identify clones that bind to both probes 
under hybridization and wash conditions. Oligonucleotides can be synthesized on an 

25 Applied BioSystems oligonucleotide synthesizer according to specifications provided 
by the manufacturer. 

Typically, hybridization and washing conditions are performed at according to 
conventional hybridization procedures. Typical hybridization conditions for 
screening plaque lifts (Benton and Davis (1978) Science 196: 180) can be: 50% 

30 formamide, 5 x SSC (sodium chloride, sodium citrate) or SSPE (sodium chloride, 
sodium phosphate, EDTA), 1-5 x Denhardt's solution, 0.1-1% SDS, 100-200 fdg 
sheared heterologous DNA or tRNA, 0-10% dextran sulfate, 1 x 10^ to 1 x 10^ 
cpm/ml of denatured probe with a specific activity of about 1x10^ cpm///g, and 
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incubation at 42°C for about 6-36 hours. Prehybridizaiion conditions are essentially 
identical except that probe is not included and incubation time is typically reduced. 
Washing conditions are typically 1-3 x SSC, 0.1-1% SDS, 42-70°C with change of 
wash solution at about 5-30 minutes. Cognate bacterial sequences, including allelic 
5 sequences, can be obtained in this manner. For high stringency hybridization 
conditions, various parameters can be altered to increase the stringency of 
hybridization, such as by increasing the temperature of incubation with the labeled 
probe. Preferably, for greater flexibility in experimental design, the probe can be 
hybridized at a lower temperature, such as, for example, room temperature and the 

1 0 stringency can then be modified by altering the salt concentration and temperature of 
the wash solutions. For high stringency a wash temperature of greater than or equal to 
42''C can be used, such as, for example, 68°C, in a wash buffer having a salt 
concentration less than 3X SSC, such as, for example, O.IX SSC. In some cases, 
TMACL can also be used, particularly for polynucleotides rich in G-C base pairs in 

15 order to decrease non-specific binding. A lower stringency wash can be used to 

hybridize polynucleotides with lower identities or polynucleotides that are less than 
60 base pairs in length. For a low stringency wash, temperatures of less than or equal 
to 42° can be used in a wash buffer having a salt concentration of greater than or 
equal to 2X SSC. 

20 The invention includes methods for amplification of target nucleic acids, such 

as the polymerase chain reaction ("PCR") technique. The PCR technique can be 
applied to identify related sequences in the genomes of various organisms and to 
detect nucleotide sequences in suspected samples, using oligonucleotide primers 
spaced apart from each other and based on the genetic sequence set forth herein. The 

25 primers are complementary to opposite strands of a double-stranded DNA molecule 
and are typically separated by from about 50 to 450 nucleotides or more (usually not 
more than 2000 nucleotides). This method entails preparing the specific 
oligonucleotide primers followed by repeated cycles of target DNA denaturation, 
primer binding, and extension with a DNA polymerase to obtain DNA fragments of 

30 the expected length based on the primer spacing. Extension products generated from 
one primer serve as additional target sequences for the other primer. The degree of 
amplification of a target sequence is controlled by the number of cycles that are 
performed and is theoretically calculated by the simple formula 2n where n is the 
number of cycles. Given that the average efficiency per cycle ranges from about 65% 
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to 85%, 25 cycles produce from 0.3 to 4.8 million copies of the target sequence. The 
PCR method is described in a number of publications, including Saiki et al.. Science 
(1985) 230:1350-1354; Saiki et aL, Nature (1986) 324:163-166; and Scharf et al.. 
Science (1986) 233:1076-1078. Also see U.S. Patent Nos. 4,683,194; 4,683,195; and 
5 4,683,202, the text of each patent is herein incorporated by reference. Additional 
methods for PCR amplification are described in: PCR Technology: Principles and 
Applications for DNA Amplification ed. HA Eriich, Freeman Press, New York, NY 
(1992); PCR Protocols: A Guide to Methods and Applications, eds. Innis, Gelfland, 
Snisky, and White, Academic Press, San Diego, CA (1990); Mattila et al. (1991) 

10 Nucleic Acids Res, 19: 4967; Eckert, K.A. and Kunkel, T.A. (1991 ) PCR Methods and 
Applications 7:17, and; PCR, eds. McPherson, Quirkes, and Taylor, IRL Press, 
Oxford, all of which are incorporated herein by reference. 

In yet another embodiment, an antisense polynucleotide can be administered to 
a mammal to treat or prevent a medical condition involving pathologic and/or 

1 5 hypoxia-driven angiogenesis. The antisense oligonucleotides of the invention can be 
synthesized by any of the known chemical oligonucleotide synthesis methods. Such 
methods are generally described, for example, in Winnacker, From Genes to Clones: 
Introduction to Gene Technology. VCH Veriagsgesellschaft mbH (H., Ibelgaufts 
trans. 1987). Any of the known methods of oligonucleotide synthesis can be utilized 

20 in preparing the instant antisense oligonucleotides. The antisense oligonucleotides are 
most advantageously prepared by utilizing any of the commercially available, 
automated nucleic acid synthesizers. The device utilized to prepare the 
oligonucleotides described herein, the Applied Biosystems 3 SOB DNA Synthesizer, 
utilizes -cyanoethyl phosphoramidite chemistry. Antisense oligonucleotides 

25 hybridizable with any portion of the mRNA transcript can be prepared by the 

oligonucleotide synthesis methods knovm to those skilled in the art. While any length 
oligonucleotide can be utilized in the practice of the invention, sequences shorter than 
12 bases may be less specific in hybridizing to the target GBS toxin receptor mRNA, 
and may be more easily destroyed by enzymatic digestion. Hence, oligonucleotides 

30 having 12 or more nucleotides are preferred. Sequences longer than 1 8 to 21 
nucleotides may be somewhat less effective in inhibiting GBS toxin receptor 
translation because of decreased uptake by the target cell. Thus, oligomers of 1 2-2 1 
nucleotides are most preferred in the practice of the present invention, particularly 
oligomers of 12-18 nucleotides. Oligonucleotides complementary to and hybridizable 
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with any portion of the GBS toxin receptor mRNA transcript are, in principle, 
effective for inhibiting translation of the transcript, and capable of inducing the effects 
herein described. Translation is most effectively inhibited by blocking the mRNA at a 
site at or near the initiation codon. Thus, oligonucleotides complementary to the 5' 
5 region of the GBS toxin receptor mRNA transcript are preferred. Secondary or 
tertiary structure which might interfere with hybridization is minimal in this region. 
Moreover, sequences that are too distant in the 3' direction from the initiation site can 
be less effective in hybridizing the mRNA transcripts because of a "read-through" 
phenomenon whereby the ribosome is postulated to unravel the antisense/sense 

10 duplex to permit translation of the message, (see, e.g. Shakin, J. Biochemistry 261, 
1601 8 (1986)). The antisense oligonucleotide is preferably directed to a site at or near 
the ATG initiation codon for protein synthesis. Oligonucleotides complementary to a 
portion of the GBS toxin receptor mRNA including the initiation codon are preferred. 
While antisense oligomers complementary to the 5' region of the GBS toxin receptor 

1 5 transcript are preferred, particularly the region including the initiation codon, it should 
be appreciated that useful antisense oligomers are not limited to those complementary 
to the sequences found in the translated portion of the mRNA transcript, but also 
includes oligomers complementary to nucleotide sequences contained in, or extending 
into, the 5' and 3* untranslated regions. Antisense nucleotides or antisense expression 

20 constructs can find use to treat or prevent diseases associated with pathologic or 

hypoxia-driven angiogenesis and neovascularization, as inappropriate expression of 
GBS toxin receptor results in hyperproliferation of endothelial cells. 

In one embodiment, the polynucleotides of the invention can exist in linear 
form. In another embodiment, the polynucleotides can exist in circular form as part of 

25 aplasmid. 

In yet another embodiment, a probe or PGR primer comprises a group of 

polynucleotide species containing different degenerate codons at various positions, 

which polynucleotides encode, or are complementary to sequences encoding, a GBS 

toxin receptor in whole or in part. Such polynucleotides can be useful for isolating 

30 nucleic acid sequences encoding polypeptides having at least about 85% identity to 

the amino acid sequence of sheep or human GBS toxin receptor, such as, for example, 

GBS toxin receptors of other organisms. Typically, such polynucleotides are 

synthesized chemically as described above by programming a synthesizer to 

incorporate a particular combination of nucleic acid residues at a certain position. 
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Typical designations are shown in Table 2. 

Table 2 
Base Codes 



Svmbol 


Meaning 


A 


A; adenine 


C 


C; cytosine 


G 


G; guanine 


T 


T; thymine 


U 


U; uracil 


M 


AorC 


R 


Aor G 


W 


AorT/U 


S 


CorG 


Y 


CorT/U 


K 


GorT/U 


V 


A or CorG; not T/U 


H 


A or C or TAJ; not G 


D 


A or G or T/U; not C 


B 


C or G or T/U; not A 


N 


A or C or G or T/U 
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Polypeptides 

Another aspect of the invention provides polypeptides comprising (1) the full 
length GBS toxin receptor protein or a naturally occurring allelic variant thereof, (2) 
fragments of at least 3 amino acids of the amino acid sequence of SEQ ID NO: 2, 4, 8, 
5 1 0 or 1 2, and (3) a GBS toxin receptor protein, polypeptide, or polypeptide fragment 
having an amino acid identity in the range of about 80% to 100% to the amino acid 
sequence of a corresponding region of SEQ ID NO: 2, 4 or 8. Preferred fragments of 
the amino acid sequence of SEQ ID NO: 2, 4, 8, 10 or 12, are at least 5, 6, 7, 8 or 9 
amino acids in length and are immunologically reactive, i.e., immunogenic. More 

10 preferred are fragments at least 25 amino acids in length and fragments comprising 
the amino acid sequence of residues 181 to 419 of SEQ ID NO: 2 or residues 1 to 240 
of SEQ ID NO: 4. Most preferred are fragments that can bind GBS toxin. Preferably, 
the GBS toxin receptor protein, polypeptide, or polypeptide fragment has an amino 
acid identity to the amino acid sequence of a corresponding region of SEQ ID NO: 2, 

15 4 or 8 of at least about 86%, more preferably at least about 95% identity, even more 
preferably at least about 99% identity up to having one amino acid difference, and 
most preferably 100% identity. Preferred polypeptides have at least about 89% 
identity, more preferably at least about 95% identity, even more preferably at least 
about 99% identity up to having one amino acid difference, and most preferably 100% 

20 identity to the amino acid sequence of residues 181 to 419 of SEQ ID NO: 2, residues 
1 to 495 of SEQ ID NO: 4. Preferably, a full length GBS toxin receptor protein 
comprises the amino acid sequence of residues 1 to 495 of SEQ ID NO: 2, residues 1 
to 495 of SEQ ID NO: 4, or residues 1 to 536 of SEQ ID NO: 8, or an allelic variant 
thereof The polypeptides of the invention can include amino acids in addition to the 

25 GBS toxin receptor protein, polypeptide, or polypeptide fragment. Such polypeptides 
typically comprise a heterologous polypeptide joined to a second polypeptide derived, 
as described above, from a GBS toxin receptor. Preferably the additional amino acids 
are covalently linked to the amino- terminal or carboxy-terminal terminus of the GBS 
toxin receptor protein, polypeptide, or polypeptide fragment. 

30 Fragments or analogs of GBS toxin receptor can be prepared by those of 

ordinary skill in the art. Preferred amino- and carboxy-termini of fragments or 
analogs occur near boundaries of fiinctional domains. For example, such functional 
domains include domains conferring the property of induction of an inflammatory 
response upon binding of GBS toxin to the GBS toxin receptor. GBS toxin mediates 
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the binding and opsonization by C3 of endothelial cells that express the GBS toxin 
receptor. Such domains can comprise the binding site for GBS toxin, in whole or in 
part, or domains otherwise essential for GBS toxin receptor structure and/or function. 
Preferably, computerized comparison methods are used to identify sequence motifs or 
predicted protein conformation domains that occur in other proteins of known 
structure and/or function. Methods to identify protein sequences that fold into a 
known three-dimensional structure are known (Bowie et al. (1991) Science 253: 164). 
Computerized prediction methods, such as, for example, a hydropathy profile as 
provided by the "Soap" program in PC/GENE can be employed to identify putative 
structural and functional domains. Using the method of Klein, Kanehisa and DeLise, 
Biochim Biophys Acta (1985) 815:468-476, the inventors have classified a sheep 
GBS toxin receptor, SP55, as an integral protein with seven transmembrane segments 
predicted. Such a protein is also known colloquially in the art as a "7-spanner". The 
predicted segments are set forth below in Table 3. 



15 



Table 3 

Predicted Transmembrane Domains of SPSS 





Inner Boundaries 


Outer Boundaries 


Segment 
Sequence 


P:I odds* 


No. 


From 


To 


From 


To 


1 


232 


248 


226 


252 


FFGIVGIIWFILWICLV 
(232-248 ofSEQIDNo. 4) 


2.589323E-05 


2 


369 


385 


365 


389 


LIGMIGPAIFLVAAGFI 
(369-385 ofSEQIDNo. 4) 


1.0073 llE-03 


3 


458 


474 


456 


479 


TVFCIAAAINVFGAIFF 
(458-474 ofSEQIDNo. 4) 


2.482542E-03 


4 


137 


153 


135 


157 


LLLGFGIFATAIFTLFT 
(137-153 OfSEQIDNo. 4) 


7.564906E-03 


5 


42 


58 


42 


58 


LAFLSFFGFFVLYSLRV 
(42-58 OfSEQIDNo. 4) 


8.236557E-02 


6 


328 


344 


328 


345 


GFLSAVPYLGCWLCMI 
L (328-344 ofSEQIDNo. 
4) 


.1925022 


7 


390 


406 


390 


407 


SLAVAFLTISTTLGGFC 
(390-406 of SEQ ID No. 4) 


.8064944 



sequence. An integral sequence with a higher hydrophobicity number is more likely 
to be part of a transmembrane domain. 

20 A computerized alignment of the amino acid sequences of GBS toxin receptor 

in various organisms provides further guidance in preparing preferred fragments. See, 
for example, Table 4 which compares the amino acid sequence of residues 42 to 536 

21. 
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of a human GBS toxin receptor (HP59) (residues 42 to 536 of SEQ ID NO: 8) and a 
sheep GBS toxin receptor (SP55). 



Table 4 

Alignment of Human and Sheep GBS Toxin Receptor Amino Acid 
5 Sequences 

SP55 MKSPVSDLAPSDGEEGSDRTPLLQRAPRAEPAPVCCSARYNLAFLSFFGF 50 

I ill III MM II MM Mill Mill II Mill I Mil 

HP55 MRSPVRDLARITOGEESTDRTPLLPGAPRAEAAPVCCSARYNLAILAFFGF 50 

S P5 5 FVLYSLRVNLSVALVDMVDSNTTAKDNRTS YECAEHSAPI KVLHNQTGKK 100 

I I IIIIIIIIIIIIMIIII Mill I llllllll Mlllll 

HPS 5 FI VYALRVNLSVALVDMVDSNTTLEDNRTSKACPEHSAPI KVHHNQTGKK 100 

SP55 YRWDAETQGWILGSFFYGYIITQIPGGYVASRSGGKLLLGFGIFATAIFT 150 

I MMMMMMMMMMMMIMM III llllll II I 

HP55 YQWDAETQGWILGSFFYGYIITQIPGGYVASKIGGKMLLGFGILGTAVLT 150 

SP5 5 LFTPLAADFGVGALVALRALEGLGEGVTYPAMHAMWSSWAPPLERSKLLS 2 00 

MM III III I llllllllllll IIIIIIIIIIIMIIIIIIII 

HP55 LFTPIAADLGVGPLIVLRALEGLGEGVTFPAMHAMWSSWAPPLERSKLLS 200 

SP5 5 I S YAGAQLGTWSLPLSGVI CYYMNWTYVFYFFGI VGII WFI LWI CLVSD 250 

lllllllllll IIIMI MMIMMMMM II II Mi MM 

HPS 5 I SYAGAQLGTVISLPLSGIICYYMNWTYVFYFFGTIGIFWFLLWI WLVSD 250 

SP55 TPETHKTITPYEKEYILSSLKNQLSSQKSVPWIPMLKSLPLWAIWAHFS 300 

II III IIIMMIM Mllllllilll IIIMMIIilllll 

HP55 TPQKHKRISHYEKEYILSSLRNQLSSQKSVPWVPILKSLPLWAIWAHFS 3 00 

SPSS YNWTFYTLLTLLPTYMKEVLRFNIQENGFLSAVPYLGCWLCMILSGQAAD 3 50 

MMMMMMMMM MM MIMM MM IIIM lllllll 

HP55 YNWTFYTLLTLLPTYMKEILRFNVQENGFLSSLPYLGSWLCMILSGQAAD 350 

SPSS NLRARWNFSTLWVRRVFSLIGMIGPAI FLVAAGFI GCDYSLAVAFLTI ST 4 00 

MM IIIMI Ml IMIIIMM MM MM II Mil Ml MUM 

HPS 5 NLRAKWNFSTLCVRRIFSLIGMIGPAVFLVAAGFIGCDYSLAVAFLTI ST 400 

SPSS TLGGFCSSGFSINHLDIAPSYAGILLGITNTFATIPGMIGPIIARSLTPE 450 

IIIIIIMIIIIIIIIIIIIIIIIIIIMIIIIIIIII II M MM 

HP55 TLGGFGSSGFSINHLDIAPSYAGILLGITNTFATIPGMVGPVIAKSLTPD 450 

SPSS NT I GEWQTVFCI AAAINVFGAI FFTLFAKGEVQNWAI SDHQGHRN 4 95 

II lllllll lillMIIIIIIIIIIIIIIIIIII II ill 

HPS 5 NTVGEWQTVFYIAAAINVFGAI FFTLFAKGEVQNWALNDHHGHRH 4 95 

HP55-SEQIDNO:2 
SP55-SEQIDNO:4 

Thus, the foregoing examples demonstrate that those of skill in the art can recognize 
sequence motifs and structural conformations that may be used to define structural 
1 0 and functional domains in a GBS toxin receptor sequence. 

22. . 
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Although one class of preferred embodiments are fragments having amino- 
and/or carboxy-termini corresponding to amino acid positions near functional 
domains borders, alternative fragments may be prepared. The choice of the amino- 
andncarboxy-termini of such fragments rests with the discretion of the practitioner and 
5 will be made based on experimental considerations, such as ease of construction, 
stability to proteolysis, thermal stability, immunological reactivity, amino- or 
carboxyl-terminal residue modification, or other considerations. Polypeptide 
fragments usually contain at least nine amino acids and can contain any number of 
amino acids provided that the peptide fragment is at least about 80% identical to the 

10 corresponding fragment of SEQ ID NO: 2, SEQ ID NO: 4, or SEQ ID N0:8. The 

human GBS toxin receptor has 41 additional amino acids on the N-terminus compared 
to the sheep GBS toxin receptor (compare SEQ ID NO:4 and SEQ ID N0:8). 
Analogs can comprise additions or deletions of some or all of those 41 N-terminal 
amino acids. N-terminal and C-terminal additions useful, e.g., for purification and/or 

1 5 antibody recognition are also contemplated. Examples include histidine tags, a FLAG 
(phenylalanine, leucine, alanine, guanine) epitope, fusion partners such as glutathione 
S transferase, chloramphenicol acetyltransferase (CAT), luciferase, P-galactosidase, 
and the like. Deletions of unconserved amino acids are also contemplated, provided 
that the structural integrity and/or binding properties of the GBS toxin receptor are not 

20 substantially compromised. 

Analogs can also comprise amino acid substitutions, preferably conservative 
substitutions. Also preferred are conservative and/or non-conservative substitutions 
in regions having less shared identity among various species. For example, a variant 
of a GBS toxin receptor can comprise conservative and/or non-conservative 

25 substitutions of amino acids corresponding to residues 2, 6, 10, 1 1, 16, 17, 24, 31, 44, 
46, 52, 53, 55, 74, 75, 81, 82, 84, 93, 102, 132, 133, 137, 144, 145, 148, 149, 155, 
159, 163, 165, 166, 179, 212, 219, 235, 236, 239, 242, 246, 253, 254, 257, 259, 260, 
271, 283, 285, 319, 324, 332, 333, 338, 355, 362, 366, 377, 439, 442, 445, 450, 453, 
461 , 487, 488, 491 and 495 of SEQ ID N0:4. Preferably the substitution is an amino 

30 acid present in the corresponding position of SEQ ID NO:4 or SEQ ID N0:8. For 
example, referring to the alignment plot in Table 4, the amino acid corresponding to 
position 152 of SEQ ID N0:4 can be arginine (R), glutamine (Q), or a conservative or 
non-conservative substitution of R or Q, and preferably is R or Q. Such regions can 

23. 
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be identified by amino acid sequence alignment plots, such as that shown in Table 4. 
Preferred amino acid substitutions are those which: (1) reduce susceptibility to 
proteolysis, (2) reduce susceptibility to oxidation, (3) alter binding affinity for GBS 
toxin, and (4) confer or modify other physicochemical or functional properties of such 
5 analogs. Analogs can include various mutations of a sequence other than the 

naturally-occurring peptide sequence, such as, for example, single or multiple amino 
acid substitutions. 

A conservative amino acid substitution should generally not substantially 
change the structural characteristics of the parent sequence (e.g., a replacement amino 

10 acid should not tend to break a helix that occurs in the parent sequence, disrupt 
disulfide bonds or disrupt other types of secondary structure that characterizes the 
parent sequence). Examples of art-recognized polypeptide secondary and tertiary 
structures are described in Proteins, Structures and Molecular Principles, (1984) 
Creighton (ed.), W.H. Freeman and Company, New York; Introduction to Protein 

15 Structure, (1991), C. Branden and J. Tooze, Garland Publishing, New York, NY; and 
Thornton et al. {\99\) Nature 354: 105 (which are incorporated herein by reference). 
A conservative substitution is a "replacement of an amino acid in a polypeptide by 
one with similar characteristics." (McGraw-Hill Dictionary of Scientific and 
Technical Terms, Fifth Edition, 1994, Sybil P. Parker, Editor in Chief). The structure 

20 and characteristics of naturally occurring amino acids has long been known in the art 
(Biochemistry, Second Edition, Albert L. Lehninger, 1975, pages 71-76) For 
example, amino acids which are similar by virtue of their hydrophobic R groups are 
alanine, valine, leucine, isoleucine, proline, phenylalanine, tryptophan, and 
methionine. Alanine, valine, leucine, and isoleucine are similar by virtue of their 

25 aliphatic R groups. Phenylalanine and tryptophan are similar by virtue of their 
aromatic R groups. Glycine, serine, threonine, cysteine, tyrosine, asparagine, and 
glutamine are similar by virtue of their uncharged polar R groups. Glycine and 
alanine are similar by virtue of their small size. Serine and threonine are similar by 
virtue of a hydroxyl in their R group. Asparagine and glutamine differ by only one 

30 methyl group. Similarly, aspartic acid and glutamic acid differ by only one methyl 

group, and they are similar by virtue of their acidic R groups. Lysine, arginine, and 

histidine are similar by virtue of their basic R groups. In addition, lysine and arginine 

are similar by virtue of the amino groups on the end of the aliphatic chain in their R 

groups. Tyrosine and phenylalanine are similar by virtue of their aromatic groups. 

24. 
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Amino substitutions commonly made in the art include a substitution of valine for 
leucine or isoleucine, alanine for glycine, serine for threonine, asparagine for 
glutamine, aspartic acid for glutamic acid, and lysine for arginine, tyrosine for 
phenylalanine, and vice versa. 
5 Typically, one skilled in the art would generally refrain from changing amino 

acids that are conserved among the various GBS toxin receptors, but a conservative 
substitution might reasonably be made. For example, Table 4 guides one skilled in 
the art to avoid substitutions, particularly nonconservative substitutions, for amino 
acids corresponding to residues 1, 3-5, 7-9, 12-15, 18-23, 26-30, 32-43, 45, 47-51, 54, 

10 56-73, 76-80, 83, 85-92, 94-101, 103-131, 134-136, 138-143, 146-147, 150-154, 156- 
158, 160-162, 164, 167-178, 180-211, 213-218, 220-234, 237-238, 240-241, 243-245, 
247-252, 255-256, 258, 261-270, 272-282, 284, 286-318, 320-323, 325-331, 334-337, 
339-354, 356-361, 363-365, 367-376, 378-438, 440-441, 443-444, 446-449, 451-452, 
454-460, 462-486, 489-490 and 492-494 of SEQ ID N0:4, which are conserved 

15 among the GBS toxin receptors shown in Table 4. 

Tables 5 and 6 describe sequences within HP59 and SP55, respectively, that 
match predicted amidation, N-glycosylation, cAMP-phosphorylation, CK2- 
phosphosylation, myristylation (addition of unsaturated fatty acid molecules), and 
PKC-phosphosylation sites (Omega 1 . 1 sequence analysis program). The information 

20 contained in these tables provides guidance to one skilled in the art for designing GBS 
toxin receptor variants and fragments. When designing polypeptide variants, for 
example, one may decide to avoid substitutions in some or all of these regions. When 
designing polypeptide fragments other than immunogenic polypeptide fragments, for 
example, one may opt to include some or all of these regions. 



Tables 




Table 6 




Putative Recognition Sites 




Putative Recognition Sites 




in HP59 




in SPSS 






Seq. ID 






Seq. ID 






NO: 8 






NO: 4 




Site 


Residues: 


Sequence 


Site 


Residues: 


Sequence 


AMIDATION 


23-26 


SGRR 


AMIDATION 


97-100 


TGKK 


AMIDATION 


13S-i4t 


TGKK 


ASN GLYCOSYLATION 


59-62 


NLSV 


ASN GLYCOSYLATION 


100-103 


NLSV 


ASN GLYCOSYLATION 


71-74 


NTTA 


ASN GLYCOSYLATION 


112-115 


NTTL 


ASN GLYCOSYLATION 


77-80 


NRTS 


ASN GLYCOSYLATION 


118-121 


NRTS 


ASN GLYCOSYLATION 


95-98 


NQTG 


ASN GLYCOSYLATION 


136-139 


NQTG 


ASN GLYCOSYLATION 


225-228 


NWTY 


ASN GLYCOSYLATION 


266-269 


NWTY 


ASN GLYCOSYLATION 


302-305 


NWTF 


ASN GLYCOSYLATION 


343-346 


NWTF 


ASN GLYCOSYLATION 


357-360 


NFST 


ASN GLYCOSYLATION 


398-401 


NFST 


CK2 PHOSPHO SrTE 


11-14 


SDGE 


CAMP PHOSPHO SFTE 


297-300 


ICRIS 


CK2 PHOSPHO SITE 


73-76 


TAKD 


CK2 PHOSPHO SITE 


113-116 


TTLE 


CK2 PHOSPHO SITE 


79-82 


TSYE 
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Table 5 



Table 6 



Putative Recognition Sites 




Putative Recognition Sites 




in HP59 




in SPSS 








Seq. ID 






Seq. ID 






NO: 8 






NO: 4 




Site 


Residues: 


Sequence 


Site 


Residues: 


Sequence 


CK2 PHOSPHO SITE 


114-117 . 


TLED 


CK2 PHOSPHO SITE 


259-262 


TPYE 


CK2 PHOSPHO SITE 


300-303 


SHYE 


CK2_PHOSPH0_SITE 


452-455 


TIGE 


CK2 PHOSPHO SITE 


493-496 


TVGE 


MYRISTYL 


126-13! 


GGYVAS 


MYRISTYL 


66-71 


GAPRAE 


MYRISTYL 


142-147 


GIFATA 


MYRISTYL 


167-172 


GGYVAS 


MYRISTYL 


162-167 


GALVAL 


MYRISTYL 


183-188 


GILGTA 


MYRISTYL 


172-177 


GLGEGV 


MYRJSTYL 


213-218 


GLGEGV 


MYRISTYL 


205-210 


GAQLGT 


MYRISTYL 


246-25 1 


GAQLGT 


MYRISTYL 


209-214 


GTVVSL 


MYRISTYL 


250-255 


GTVISL 


MYRISTYL 


337-342 


GCWLCM 


MYRISTYL 


378-383 


GSWLCM 


MYRISTYL 


386-391 


GCDYSL 


MYRISTYL 


427-432 


GCDYSL 


MYRISTYL 


403-408 


GGFCSS 


MYRISTYL 


444-449 


GGFCSS 


MYRISTYL 


423-428 


GILLGI 


MYRISTYL 


464-469 


GILLGI 


iVi YKlb 1 YL 


427-432 


GITNTF 


MYRISTYL 


468-473 


GITNTF 


PKC PHOSPHO SITE 


17-19 




PKC PHOSPHO SITE 


23-25 


SGR 


PKC PHOSPHO SITE 


37-39 


SAR 


PKC PHOSPHO SITE 


58-60 


TDR 


PKC PHOSPHO SITE 


55-57 


SLR 


PKC PHOSPHO SITE 


78-80 


SAR 


PKC PHOSPHO SITE 


73-75 


TAK 


PKC PHOSPHO SITE 


120-122 


TSK 


PKC PHOSPHO SITE 


97-99 


TGK 


PKC PHOSPHO SITE 


138-140 


TGK 


PKC PHOSPHO SITE 


254-256 


THK 


PKC PHOSPHO SITE 


310-312 


SLR 


PKC PHOSPHO SITE 


269-271 


SLK 


PKC PHOSPHO SITE 


317-320 


SQK 


PKC PHOSPHO SITE 


276-278 


SQK 



In light of the foregoing, preferred polypeptides comprise an amino acid 
sequence of the formula: 

AAl-AAn-AAm 

5 wherein: 

AAl is absent or is M; 

AAn is a contiguous chain of 0 to 1 00 amino acids, preferably of 0 or 41 
amino acids, even more preferably of residues 2-42 of SEQ ID NO:8; and 

AAm is a contiguous chain of 494 amino acids comprising AA43 through 
10 AA536, wherein: 

(1) each of AA43, AA47, AA51, AA52, AA57, AA58, AA65, 
AA66, AA72, AA85, AA87, AA93, AA94, AA96, AA115, AAl 16, 
AA122, AA123, AA125, AA134, AA143, AA173, AA174, AA178, 
AA185, AAl 86, AA189, AA190, AA196, AA200, AA204, AA206, 
1 5 AA207, AA220, AA253, AA260, AA276, AA277, AA280, AA283, 

AA287, AA294, AA295, AA298, AA300, AA301, AA312, AA324, 
AA326, AA360, AA365, AA373, AA374, AA379, AA396, AA403, 
AA407, AA418, AA480, AA483, AA486, AA491, AA494, AA502, 
AA528, AA529, AA532 and AA536 is an essential amino acid or a 
20 modified amino acid and preferably is an amino acid residue 

corresponding to: 

(a) residue 43, 47, 5 1 , 52, 57, 58, 65, 66, 72, 85, 87, 93, 
94, 96, 115, 116, 122, 123, 125, 134, 143, 173, 174, 178, 185, 
186, 189, 190, 196, 200, 204, 206, 207, 220, 253, 260, 276, 
25 277, 280, 283, 287, 294, 295, 298, 300. 301, 312, 324, 326, 

360, 365, 373, 374, 379, 396, 403, 407, 418, 480, 483, 486, 



26. 



BNSOOCID: <W0 0C»S375A1 JA> 



wo 00/05375 



PCT/US99/16676 



491, 494, 502, 528, 529, 532 and 536, respectively, of SEQ ID 
N0:8; 

(b) residue 2, 6, 10, 11. 16, 17, 24, 25, 31. 44, 46, 52, 
53, 55, 74, 75, 81, 82, 84. 93, 102, 132, 133, 137, 144, 145, 

5 _ 148,149,155,159,163. 165,166,179,212,219.235,236. 

239, 242, 246, 253, 254, 257, 259. 260, 271, 283. 285. 319! 
324, 332, 333, 338, 355. 362, 366, 377, 439, 442, 445, 450, 
453, 461, 487, 488. 491 and 495, respectively of SEQ ID N0:4; 
or 

10 (c) a conservative substitution thereof; 

(2) each of AA44-AA46, AA48-AA50, AA53-AA56, AA59- 
AA64, AA67-AA71, AA73-AA84, AA86, AA88-AA92, AA95, 
AA97-AA114. AA117-AA121. AA124, AA126-AA133, AA135- 
AA142, AA144-AA172, AA175-AA177, AA179-AA184. AA187- 

15 AA188, AA191-AA195, AA197-AA199, AA201-AA203, AA205, 

AA208-AA219, AA221-AA252, AA254-AA259, AA261-AA275, 
AA278-AA279, AA281-AA282, AA284-AA286, AA288-AA293, 
AA296-AA297, AA299, AA302-AA3i 1, AA313-AA323, AA325, 
AA327-AA359, AA361-AA364, AA366-AA372. AA375-AA378. 

20 AA380-AA395, AA397-AA402. AA404-AA406, AA408-AA41 7. 

AA419-AA478, AA481-AA482, AA484-AA485, AA487-AA490, 
AA492-AA493, AA495-AA501, AA503-AA527, AA530-AA531 and 
AA533-AA535 is 

(a) residue 44-46, 48-50, 53-56, 59-64, 67-71, 73-84, 
25 86, 88-92, 95, 97-114, 117-121, 124, 126-133, 135-142, 144- 

172, 175-177, 179-184, 187-188, 191-195. 197-199. 201-203. 
205. 208-219. 221-252, 254-259, 261-275, 278-279, 281-282. 
284-286, 288-293, 296-297, 299, 302-31 1, 313-323, 325. 327- 
359, 361-364, 366-372, 375-378, 380-395, 397-402, 404-406, 
30 408-417, 419-478, 481-482, 484-485, 487-490, 492-493, 495- 

501, 503-527, 530-531 and 533-535, respectively, of SEQ ID 
N0:8; or 

(b) a conservative substitutions thereof; and 
(3) AA3 1 5 through AA367 are optionally absent. 

35 Preferred polypeptides comprise the amino acid sequence of SEQ ID N0:4, 

SEQ ID NO:8 or an amino acid sequence which varies from that sequence only at the 
specific residues which are not conserved between the sheep GBS toxin receptor 
(SEQ ID N0:4) and the human GBS toxin receptor (SEQ ID NO: 8). Of those 
variations, the most preferred variations are those resulting in a polypeptide encoded 

40 by SEQ ID NO: 1 1 . Even more preferred variations are those amino acids in the 
corresponding positions of the amino acid sequence of SEQ ID N0:4. Particularly 
preferred are polypeptides comprising an amino acid sequence that differs from SEQ 
ID NO:2, SEQ ID N0:4 or SEQ ID N0:8 at no more than about 20% of the amino 
acid residues, with increasing preference for no more than about 10%, 5%, 1%, with 

45 one to zero amino acid differences being most preferred. 
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Besides targeting specific amino acids for change, analogs of GBS toxin 
receptor can also be prepared by techniques involving activity selection, such as, for 
example, phage display, directed evolution, DNA shuffling, and homologous in 
procaryotes or eucaryotes of genes from different species, as described in part in U.S. 
5 Patent Nos. 5,605,793; 5,830,721; 5,811,238; 5,837,458; 5,093,257; 5,223,409; 
5,403,484; 5,571,698; and 5,837,500, which are incorporated herein by reference. 

Any variant or fragment of the human and sheep GBS toxin receptors 
described herein can be tested for the requisite activity by determining whether the 
variant or fragment can bind GBS toxin. 
10 These polypeptides provide reagents useful in drug discovery and purification 

and can be used in various ifi vitro assays, preferably when expressed on the surface 
of a cell, e.g., a stable transfected cell. For example, assays such as binding assays 
can be used to screen test compounds, including polysaccharides and other 
compoimds, for their ability to bind the GBS toxin receptor. Assays can identify 
1 5 potential drug candidates that block GBS toxin binding to the GBS toxin receptor. 
Such drugs are useful for preventing and/or treating early onset disease in neonatal 
humans. Some polypeptides can be used to competitively inhibit binding GBS toxin 
to a GBS toxin receptor. 
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The polypeptides of the invention can be used to affinity purify GBS toxin, a 
GBS toxin chimeric compound, and other polysaccharides or compounds which can 
bind the GBS toxin receptor. 

The polypeptides can also be used to develop a method of targeting a 
5 cytotoxic agent for delivery to a cell that expresses a GBS toxin receptor. For 
example, a cytotoxic agent can be coupled to a molecule that binds a GBS toxin 
receptor for selective delivery to the neovasculature of a growing tumor. Such a 
delivery system would permit a highly concentrated, localized attack on a growing 
tumor, while minimizing the adverse systemic side effects encountered with most 
10 chemotherapeutics. In one instance, the cytotoxic agent can be GBS toxin, which, 
upon binding to GBS toxin receptor, induces an inflammatory response as described 
in Hellerqvist et al., Angiogenesis: Molecular Biologv. Clinical Aspects. Edited by 
M.E. Maragoudakis et al.. Plenum Press, New York 1994, pp. 265-269. In a similar 
manner, selective delivery of a therapeutic agent to a cell that expresses a GBS toxin 
15 receptor could be used advantageously to treat tumors, rhexmiatoid arthritis or neural 
injury, or to facilitate wound healing. 

The polypeptides of the invention can also be used to screen for and/or design 
a GBS toxin mimetic with improved therapeutic properties, such as, for example, 
improved ability to inhibit hypoxia-induced neovascularization or angiogenesis. Such 
20 mimetics are useful in the treatment and prevention of conditions resulting from 
hypoxia-induced neovascularization or angiogenesis, such as, for example, tximor 
growth, scarring during wound healing, gliosis during repair of neural injury, 
reperfiision injury, restenosis, rheumatoid arthritis, psoriasis, other chronic 
inflammatory diseases characterized by angiogenesis, etc. Therapeutic properties can 
25 be improved by enhancing biological stability, affinity for the GBS toxin receptor, 
complement binding activity, reducing antigenicity, etc. 

The polypeptides of the invention can also be used to generate antibodies for 
various therapeutic and research purposes. The polypeptides of the invention can be 
used to immunize rabbits, mice, goats, chickens, or other animals known in the art to 
30 be amenable to such inmiunization. Monoclonal antibodies are generally preferred 
but polyclonal antibodies can also be used, provided that detection of binding of the 
GBS toxin receptor antibody to the GBS toxin receptor is possible. The production of 
non-human monoclonal antibodies, e.g., murine, is well known (see, e.g., Harlow et 

al. Antibodies A Laboratory Manual, Cold Spring Harbor Press, pp. 139-240, 1989, 
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incorporated herein by reference). As it may be difficult to generate human 
monoclonal antibodies to a human receptor or binding domain polypeptide, it may be 
desirable to transfer antigen binding regions of non-human monoclonal antibodies, 
e.g.-the F(ab')2 or hypervariable regions or murine monoclonal antibodies, to human 
5 constant regions (Fc) or frameiwork regions by recombinant DNA techniques to 

produce substantially human molecules. Such methods are generally known and are 
described in, e.g., U.S. Pat. Nos. 4,816,397 and 4,946,778, and EP publications 
173,494 and 239,400. Alternatively, one may isolate DNA sequences which code for 
a human monoclonal antibody or portions thereof that specifically bind to the receptor 

10 protein by screening a DNA library from human B cells according to the general 
protocol outlined in WO 90/14430, and then cloning and amplifying the sequences 
which encode the antibody (or binding firagment) of the desired specificity. 

Usually, polypeptides used for producing antibodies are the full-length 
receptor or receptor fragments designed fi-om putative extracellular domains identified 

15 by a variety of methods known in the art, including computer programs which predict 
secondary and tertiary structure of a polypeptide based upon its primary amino acid 
sequence. Another method for designing antigenic peptides utilizes computer 
programs that predict the high points of hydrophilicity within a particular primary 
amino acid sequence. For example, using the method of Happ and Woods, Proc, 

20 NatL Acad, ScL USA (1981) 78:3824-3829, via the "Antigen" program in PC/GENE, 
the inventors identified 3 regions of high hydrophilicity, shown below in Table 7, and 
used the results to design antigenic peptides to be used in the preparation of antibodies 
against GBS toxin receptor (see Example 4). 

Table? 

25 High Points of Hydrophilicity in SPSS 



No. 


Ah 


Sequence 


1 


2.05 


Glu-Glu-Gly-Ser-Asp-Arg (14-19 of SEQ ID No. 2) 


2 


1.52 


Lys-Asp-Asn-Arg-Thr-Ser (75-80 of SEQ ID No. 2) 


3 


1.33 


Arg-Ala-Pro-Arg-Ala-Glu (25-30 of SEQ ID No. 2) 



Ah = Average hydrophilicity. 

Antibodies that recognize various portions of the intact GBS toxin receptor 
can be used to further investigate structure and function of the receptor. The 
polypeptides of the invention can give rise to antibodies that recognize a variety of 
30 forms of GBS toxin receptor, including, but not limited to, intact GBS toxin receptor 
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expressed on a cell surface, denatured GBS toxin receptor or non-denatured GBS 
toxin receptor, and GBS toxin receptor purified away from cellular components or 
GBS toxin receptor contained in a cell lysate. GBS toxin receptor antibodies can be 
usecl to study species differences as well as GBS toxin receptor expression levels in 
5 various cell types. 
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Antibodies thai recognize a portion or all of an extracellular domain are 
particularly useful as a diagnostic for the monitoring of tumor growth and metastasis, 
for the detection or identification of a chronic inflammatory condition, such as, for 
example, rheumatoid arthritis or psoriasis, and for the detection of other medical 
5 conditions arising due to hypoxia-driven angiogenesis, such as, for example, 
restenosis. Typically, such antibodies can be employed in a variety of standard 
research and diagnostic techniques, including, but not limited to, western blot, 
immunoprecipitation, ELISA, radioimmunoassay (RIA), BIACOR®, enzyme-linked- 
immimoassay (EIA), immunofluorescence, fluorescence activated cell sorting 
10 (FACS), and in vivo diagnostic imaging systems such as magnetic resonance imaging 
(MRI), nuclear magnetic resonance (NMR), computerized axial tomography (CAT) 
scan, and position emission tomography (PET), etc. 

In addition, antibodies that block the binding of GBS toxin to a GBS toxin 
receptor can be used for the treatment or prevention of early onset disease in a 
1 5 neonatal human. Such antibodies can directly or indirectly block the GBS toxin 
binding site on the GBS toxin receptor. 

In one embodiment, the GBS toxin receptor protein is naturally occurring and 
can be isolated from a cell extract by protein purification techniques known in the art, 
such as, for example, ion exchange column chromatography, high performance liquid 
20 chromatography (HPLC), reversed phase HPLC, or affinity chromatography using 
antibodies that recognize the GBS toxin receptor. 

Alternatively, the isolated proteins and polypeptides are expressed using 
polynucleotides encoding the polypeptide(s) of the invention in operative association 
with an appropriate control sequence including a promoter in an expression vector 
25 suitable for expression, preferably in a mammahan cell, and also in bacterial, insect, 
or yeast cells. 

Usually, the GBS toxin receptor polynucleotide or a fragment thereof can be 
expressed in a mammalian system. Such expression will usually depend on a 
mammalian promoter, which is any DNA sequence capable of binding mammalian 
30 RNA polymerase and initiating the downstream (3') transcription of a coding 
sequence (e.g. structural gene) into mRNA. Usually, a promoter will have a 
transcription initiation region which is usually placed proximal to the 5' end of the 
coding sequence. This transcription initiation region typically includes an RNA 

polymerase binding site and a transcription initiation site. 
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Vectors suitable for replication in mammalian cells are known in the art, and 
can include viral replicons, or sequences that ensure integration of the sequence 
encoding PAK65 into the host genome. Suitable vectors can include, for example, 
those derived from simian virus SV40, retroviruses, bovine papilloma virus, vaccinia 
5 virus, and adenovirus. 

A suitable vector, for example, is one derived from vaccinia viruses. In this 
case, the heterologous DNA is inserted into the vaccinia genome. Techniques for the 
insertion of foreign DNA into the vaccinia virus genome are known in the art, and 
utilize, for example, homologous recombination. The insertion of the heterologous 
10 DNA is generally into a gene which is non-essential in nanire, for example, the 

thymidine kinase gene (tk), which also provides a selectable marker. Plasmid shuttle 
vectors that greatly facilitate the construction of recombinant viruses have been 
described (see, for example, Mackett et al. (1984); Chakrabarti et al. (1985); Moss 
(1987)). Expression of the heterologous polypeptide then occurs in cells or 
1 5 individuals which are immunized with the live recombinant vaccinia virus. 

Such suitable manunalian expression vectors usually contain one or more 
eukaryotic transcription units that are capable of facilitating expression in mammalian 
cells. The transcription unit is comprised of at least a promoter element to mediate 
transcription of foreign DNA sequences. Suitable promoters for manmialian cells are 
20 known in the art and include viral promoters such as those from simian virus 40 

(SV40) (Subramani et al., Mol Cell. Biol. 1:854^864, 1981), cytomegalovirus (CMV) 
(Boshart et al.. Cell 41:521-530, 1985), Rous sarcoma virus (RSV), adenovirus 
(ADV) (Kaufinan and Sharp, Mol. Cell. Biol. 2:1304-1319, 1982), and bovine 
papilloma virus (BPV), as well as cellular promoters, such as a mouse 
25 metallothionein-1 promoter (U.S. Patent No. 4,579,821), a mouse VK promoter 

(Bergman et al., Proc. Natl. Acad. Sci. USA 81 :7041-7045, 1993; Grant et al., Nuc. 
Acids Res. 15:5496, 1987), and amouse VH promoter (Loh et al.. Cell 33:85-93, 
1983). 

The optional presence of an enhancer element (enhancer), combined with the 
30 promoter elements described herein, will typically increase expression levels. An 

enhancer is any regulatory DNA sequence that can stimulate transcription up to 1000- 
fold when linked to endogenous or heterologous promoters, with synthesis beginning 
at the normal mRNA start site. Enhancers are also active when they are placed 
upstream or downstream from the transcription initiation site, in either normal or 
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flipped orientation, or at a distance of more than 1000 nucleotides from the promoter 
(Maniatis el al. (1987) Science 236:1237; Alberts et al. (1989) Molecular Biology of 
the Cell, 2nd ed.). Enhancer elements derived from viruses can be particularly useful, 
because they typically have a broader host range. Examples useful in mammalian 
5 cells include the SV40 early gene enhancer (Dijkema et al (1985) EMBO J. 4:761) 
and the enhancer/promoters derived from the long terminal repeat (LTR) of the Rous 
Sarcoma Virus (Gorman et al. (1982b) Proc. Natl. Acad. Sci. 79:6777), from human 
cytomegalovirus (Boshart et al. (1985) Cell 41 :521) as well as the mouse |i enhancer 
(Gillies, Cell 33:717-728, 1983). Additionally, some enhancers are regulatable and 

10 become active only in the presence of an inducer, such as a hormone or metal ion 
(Sassone-Corsi and Borelh (1986) Trends Genet. 2:215; Maniatis et al. (1987) 
Science 236:1237). 

In addition, the transcription unit can also be comprised of a termination 
sequence and a polyadenylation signal which are operably linked to the GBS toxin 

15 receptor coding sequence. Polyadenylation signals include, but are not limited to, the 
early or late polyadenylation signals from SV40 (Kaufinan and Sharp), the 
polyadenylation signal from the Adenovirus 5 ElB region and the human growth 
hormone gene terminator (DeNoto et al., Nuc. Acids Res, 9:3719-3730, 1981). 

Sequences that cause amplification of the gene may also be desirable, as are 

20 sequences which encode selectable markers. Selectable markers for mammalian cells 
are known in the art, and include, for example, thymidine kinase, dihydrofolate 
reductase (together with methotrexate as a DHFR amplifier), aminoglycoside 
phosphotransferase, hygromycin B phosphotransferase, asparagine synthetase, 
adenosine deanwnase, and antibiotic resistant genes such as neomycin. 

25 A GBS toxin receptor, or fragment thereof, can be expressed on the surface of 

a cell, or can be expressed in soluble or secreted form. Expression on the surface of 
the cell can be achieved, for example, by including a secretory leader operably linked 
to a nucleic acid sequence encoding the desired receptor fragment and at least one 
transmembrane domain. The secretory leader can be that encoded by the GBS toxin 

30 receptor gene, or can be a heterologous leader sequence commonly used in the art, 
such as, for example, the leader sequence of Schizosaccharomyces pombe phol+ acid 
phosphatase (Braspenning et al., Biochem Biophys Res. Commim (1998) 245:166- 
71), the leader sequence of human interleukin-2 (IL-2) gene (Sasada et al.. Cell Struct 
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Funct (1988) 13:129-141). Expression in solul?le or secreted form can be achieved, 
for example, by excluding from the gene construct nucleic acid sequences encoding a 
transmembrane domain. In some instances, solubility and/or secretion are achieved 
by 'the use of a fusion partner, such as, for example, chloramphenicol acetyltransferase 
5 (CAT), P-galactosidase, and other genes readily expressed in the selected host cell. 

The vector that encodes GBS toxin receptor can be used for transformation of 
a suitable mammalian host cell. Transformation can be by any known method for 
introducing polynucleotides into a host cell, including, for example packaging the 
polynucleotide in a virus and transducing a host cell with the virus or by transfection 

1 0 procedures known in the art, as exemplified by U.S. Patent Nos. 4,399,216, 
4,912,040, 4,740,461, and 4,959,455 (these patents are incorporated herein by 
reference). The transformation procedure used depends upon the host to be 
transformed. Methods for introduction of heterologous polynucleotides into 
mammalian cells are known in the art and include dextran-mediated transfection, 

15 calcium phosphate precipitation, polybrene mediated transfection, protoplast fusion, 
electroporation, encapsulation of the polynucleotide(s) in liposomes, and direct 
microinjection of the DNA into nuclei. 

Mammalian cell lines available as hosts for expression are known in the art 
and include many immortalized cell lines available from the American Type Culture 

20 Collection (ATCC), including but not limited to Chinese hamster ovary (CHO) cells, 
HeLa cells, baby hamster kidney (BHK) cells, monkey kidney cells (COS), NlE-1 15 
(Liles et al., J. Biol. Chem. 261:5307-5313, 1986), PC 12 human hepatocellular 
carcinoma cells (e.g.. Hep G2), and a number of other cell lines, such as insect 
derived cell lines IF9 and IF21. Cell lines of particular preference are those expressing 

25 recombinant GBS toxin receptor constructs constitutively, lines which subsequently 
develop characteristics of a transformed cell, and lines which more preferably express 
GBS toxin receptor or fragments on the cell surface. Particularly preferred are ECV 
cells (a bladder carcinoma cell line originally referred to in the scientific literature as 
an endothelial cell line), human umbilical vein endothelial cells (HUVEC), bovine, 

30 sheep, and human adrenal medulla endothehal cells. 

Recombinant GBS toxin receptor or fragments thereof can be produced by 
culturing host cells expressing the receptor or fragment in a suitable culture medium 
and under appropriate cell culture conditions. Culture media and conditions are 
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variable depending on the requirements of a particular host cell line and are well 
known in the art. Typically, cells are cultured at 37°C in a cell culture incubator with 
a fixed amount of C02, usually in the range of 5-10%. 

In another embodiment, the polypeptide firagments can be synthesized 
5 chemically by techniques well known in the art, such as solid-phase peptide synthesis 
(Stewart et al., Solid Phase Peptide Synthesis, W.H. Freeman Co., San Francisco 
(1963)); Merrifield, J Am Chem Soc 85:2149-2154 (1963)). These and other methods 
of peptide synthesis are also exemplified by U.S. Patent Nos. 3,862,925, 3,842,067, 
3,972,859, and 4,105,602. The synthesis can use manual synthesis techniques or 

10 automatically employ, for example, an Applied BioSystems 430A or 431 A Peptide 
Synthesizer (Foster City, California) following the instructions provided in the 
instruction manual supplied by the manufacturer. It will be readily appreciated by 
those having ordinary skill in the art of peptide synthesis that the intermediates which 
are constructed during the course of synthesizing the present analog compounds are 

15 themselves novel and useful compounds and are thus within the scope of the 
invention. 

In addition to polypeptides consisting only of naturally-occurring amino acids, 
peptidomimetics are also provided. Peptide analogs are commonly used in the 
pharmaceutical industry as non-peptide drugs with properties analogous to those of 

20 the template peptide. These types of non-peptide compounds are termed "peptide 
mimetics" or "peptidomimetics" (Fauchere, J. (1986) Adv. Drug Res. 15: 29; Veber 
and Freidinger (1985) TINS p.392; and Evans et al. (1987) J. Med. Chem 30: 1229, 
which are incorporated herein by reference) and are usually developed with the aid of 
computerized molecular modeling. Peptide mimetics that are structurally similar to 

25 therapeutically useful peptides may be used to produce an equivalent therapeutic or 
prophylactic effect. Generally, peptidomimetics are structurally similar to a paradigm 
polypeptide (i.e., a polypeptide that has a biochemical property or pharmacological 
activity) but have one or more peptide linkages optionally replaced by a linkage 
selected from the group consisting of: -CH2NH., .CH2S-, .CH2-CH2-, -CH=CH- 

30 (cis and trans), -COCH2-, -CH(0H)CH2-, and -CH2S0-, by methods known in the art 
and further described in the following references: Spatola, A.F. in "Chemistry and 
Biochemistry of Amino Acids, Peptides, and Proteins," B. Weinstein, eds., Marcel 
Dekker, New York, p. 267 (1983); Vol. 1, Issue 3, "Peptide Backbone Modifications" 

(general review); Morley, J.S., Trends Pharm Sci (1980) pp. 463-468 (general 
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review); Hudson, D. et al., Int J Pept Prot Res (1979) 14:177-185 (-CH2NH-, 
CH2CH2.); Spatola, A.F. et al., Life Sci (1986) 38:1243-1249 (-CH2-S); Hann, 
M.M., J Chem Soc Perkin Trans I (1982) 307-314 (-CH-CH-, cis and trans); 
Almquist, R.G. et al., J. Med Chem (1980) 23:1392-1398 (-C0CH2-); Jennings- 
5 White, C. et al.. Tetrahedron Lett (1982) 23:2533 (-C0CH2-); Szelke, M, et al., 

European Appln. EP 45665 (1982) CA: 97:39405 (1982) (-CH(0H)CH2-); Holladay, 
M.W. et al., Tetrahedron Lett (1983) 24:4401-4404 (-C(0H)CH2-); and Hruby, V.J., 
Life Sci (1982) 31:189-199 (-CH2-S-); each of which is incorporated herein by 
reference. A particularly preferred non-peptide linkage is -CH2NH-. Such peptide 

10 mimetics may have significant advantages over polypeptide embodiments, including, 
for example: more economical production, greater chemical stability, enhanced 
pharmacological properties (half-life, absorption, potency, efficacy, etc.), altered 
specificity (e.g., a broad-spectrum of biological activities), reduced antigenicity, and 
others. Labeling of peptidomimetics usually involves covalent attachment of one or 

15 more labels, directly or through a spacer (e.g., an amide group), to non-interfering 
position(s) on the peptidomimetic that are predicted by quantitative structure-activity 
data and/or molecular modeling. Such non-interfering positions generally are 
positions that do not fomi direct contacts with GBS toxin (e.g., are not contact points 
in the GBS toxin binding domain of the GBS toxin receptor). Derivitization (e.g., 

20 labelling) of peptidomimetics should not substantially interfere with the desired 
biological or pharmacological activity of the peptidomimetic. 

Systematic substitution of one or more amino acids of a consensus sequence 
with a D-amino acid of the same type (e.g., D-lysine in place of L-lysine) may be 
used to generate more stable peptides. In addition, constrained peptides comprising a 

25 consensus sequence or a substantially identical consensus sequence variation may be 
generated by methods known in the art (Rizo and Gierasch (1992) Ann. Rev. 
Biochem. 61: 387, incorporated herein by reference); for example, by adding internal 
cysteine residues capable of forming intramolecular disulfide bridges which cyclize 
the peptide. 

30 The invention also provides a complex comprising a GBS toxin boimd to a 

mammalian GBS toxin receptor or a firagment of a mammalian GBS toxin receptor. 
Preferably, the complex comprises a GBS toxin bound to a GBS toxin receptor 
polypeptide described above that can bind GBS toxin. Typically, a complex is 
formed by contacting a GBS toxin with such a polypeptide under conditions that 
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permit specific binding of the GBS toxin to the polypeptide. The GBS toxin can be 
labeled or unlabeled. The polypeptide can be present on the surface of a cell, or 
immobilized in a well or on a bead, or the polypeptide can be present in solution. 

5 Detection Methods 

Yet another aspect of the invention provides methods for detecting or 
monitoring a variety of medical conditions characterized by pathologic and/or 
hypoxia-driven angiogenesis or neovascularization. Examples include, but are not 
limited to, early onset disease in the neonate, and the progression of cancers involving 

1 0 tumors. 

Early onset disease can be diagnosed by detecting the presence or absence of 
GBS toxin in a patient. One method of detection is a competition assay that 
determines the effect of a suspected sample on the formation of a complex betv^een 
GBS toxin and a GBS toxin receptor or fragment thereof For example, the method 

1 5 comprises contacting a control GBS toxin with a GBS toxin receptor polypeptide, in 
the presence and absence of a sample suspected of containing GBS toxin and under 
conditions that permit specific binding of the GBS toxin to the polypeptide, and 
comparing the amount of complex formation achieved in the presence of the 
suspected sample to the amount of complex formation achieved in the absence of the 

20 suspected sample. Preferably, the control GBS toxin is substantially purified and of a 
known concentration. Preferably, the control GBS toxin further comprises a label. 
Suitable labels include, but are not limited to, radioisotopes, chromophores, 
fluorophores, biotin, avidin, and other labels used by one skilled in the art. Another 
method directly measures, rather than by competition with a control GBS toxin, 

25 complex formation between GBS toxin present in a suspected sample and a GBS 
toxin receptor polypeptide. 

Pathologic vasculature can be detected in a mammalian tissue by detecting the 
presence or absence of GBS toxin receptor in the region of a tumor, with the presence 
of GBS toxin receptor being indicative of the presence of pathologic vasculature. The 

30 method can be used to monitor tumor growth or metastasis. One method of detection 
involves the use of molecules, e.g. antibodies, that specifically bind to a GBS toxin 
receptor, preferably an extracellular domain of GBS toxin receptor. Typically, the 
method comprises administering, to a mammalian tissue, e.g. in a mammal having a 
cancerous tumor, e.g., an antibody that recognizes a GBS toxin receptor, and 
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detecting specific binding of the antibody. Typically, the antibody is a labeled 
antibody. Preferably, the observations are quantitative and can be visual. 

During surgery, the margin of a tumor can be visualized by any of a number of 
imaging techniques known in the art and described above. The imaging of the tumor 
5 is effected by detecting the binding of a labeled antibody or other molecules to the 
GBS toxin receptor on the pathologic vasculature of a tumor. This type of surgery is 
also known as virtual surgery because while performing the surgery, the surgeon 
views the tumor indirectly on an imaging screen. 



10 Drug Discovery 

A fourth aspect of the invention provides methods, using the polypeptides of 
the invention, of identifying drug candidates for the treatment of medical conditions 
characterized by hypoxia-driven angiogenesis or neovascularization. Preferred 
compounds are competitive inhibitors of GBS toxin binding to a GBS toxin receptor 

1 5 or inhibit GBS toxin receptor activity. Particularly preferred are compounds that 

inhibit the first phosphorylation step in the signal transduction pathway. Compounds 
can be produced by a variety of random drug design methods commonly known in the 
art, such as, for example, combinatorial chemistry (U.S. Patent No. 5,646,285; U.S. 
Patent No. 5,639,603), peptide libraries (U.S. Patent No. 5,591,646; U.S. Patent No. 

20 5,367,053; U.S. Patent No. 5,747,334), phage display (U.S. Patent No. 5,403,484; 
U.S. Patent No. 5,223,409), SELEX® (U.S. Patent No. 5,773,598; U.S. Patent No. 
5,763,595; U.S. Patent No. 5,763,566), and combinatorial carbohydrate chemistry 
(Hirschmann et al., J Med Chem (1996) 39:2441-2448; Hirschmann et al., J Med 
Chem (1998) 41 :1382-1391; Sofia MJ, Mol Divers (1998) 3:75-94; U.S. Patent No. 

25 5,780,603; U.S. Patent No. 5,756,712) 

An alternative approach is rational drug design with the intent of producing a 
GBS toxin mimetic or a GBS toxin receptor mimetic with improved therapeutic 
properties using techniques such as x-ray crystallography, nuclear magnetic resonance 
(NMR) correlation spectra (U.S. Patent No. 5,698,401), computer assisted molecular 

30 modeling (U.S. Patent No. 5,579, 250; U.S. Patent No. 5,612,895; U.S. Patent No. 
5,680,331, Cooper et al,. J. Comput.-Aided Mol. Design, 3:253-259 (1989); Brent et 
al., J. Comput.-Aided Mol. Design 2:31 1-310 (1988)) and other methods of rational 
drug design known in the art. FIG. 1 provides a broad overview of some of the main 
steps in some of the rational drug design methods of the present invention. For 
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example, one approach to rational drug design involves a computer program, such as 

INSIGHTII (available from Bisoym Technologies, 10065 Barnes Canyon Road, San 

Diego, California) to identify active sites in proteins by homology-based modeling. 

This method facilitates the modeling of a protein by using a similar protein whose 

5 structure is well known. Commercial software containing search algorithms for three 

dimensional database comparisons are available from vendors such as Day Light 

Information Systems, Inc., Irvine, California 92714, and Molecular Design Limited, 

2132 Faralton Drive, San Leandro, California 94577. 

In one embodiment, the compound can bind the GBS toxin receptor and 

10 induce an inflammatory response in a manner similar to the binding of GBS toxin to 

the GBS toxin receptor. Such compounds can be used, for example, as a drug to 

target an inflammatory response to the developing vasculature of a tumor. 

In another embodiment, the compound can bind the GBS toxin receptor with 

or without inducing an inflammatory response, preferably without inducing an 

15 inflammatory response. In one instance, the compound can be used as a vehicle to 

target pathological neovasculature for treatment with a cytotoxic agent. For example, 

the cytotoxic agent can be chemically coupled to the compound to form a chimeric 

drug. Such chimeric drugs can be used in the treatment of tumors, rheumatoid 

arthritis, wound healing, spinal cord injury, and other conditions characterized by 

20 hypoxia-driven angiogenesis or neovascularization. In another instance, the 

compound can be used directly to competitively inhibit binding of GBS toxin to a 

GBS toxin receptor. Such compounds can be used in the treatment of early-onset 

disease in the neonate. 

In a third embodiment, the compound can bind GBS toxin and can be used in 

25 the treatment of early-onset disease in the neonate. 

The polynucleotides of the invention can be expressed in random mutagenesis 

systems such as phage display or the yeast two-hybrid system for the synthesis and 

identification of mutant peptide GBS toxin receptor polypeptides that bind GBS toxin. 

Ahematively, inunobilized or soluble GBS toxin receptor fragments of the invention 

30 can be used to screen combinatorial peptide and combinatorial chemical libraries and 

non-random recombinant and synthetic peptides and other compounds (such as non- 

peptide molecules) for GBS toxin receptor binding. Compounds that bind GBS toxin 

or GBS toxin receptor can then be further characterized in a functional assay for any 

of the activities described above in order to identify a drug candidate for the treatment 
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of medical conditions involving angiogenesis or neovascularization. 

A compound whicK inhibits binding of GBS toxin to a GBS toxin receptor can 
be identified by combining a test compound with a mammalian GBS toxin receptor or 
fra^ent thereof capable of binding GBS toxin, under conditions that permit specific 
5 binding of GBS toxin to the GBS toxin receptor or fragment, and determining the 
amount of inhibition by the compound of the binding of GBS toxin to the GBS toxin 
receptor or fragment. 

In a preferred embodiment, the GBS toxin receptor or fragment is expressed 
by a cell, preferably on the cell surface. The cells are contacted with labeled GBS 

10 toxin in the presence or absence of the test compound. A change in the binding of 
GBS toxin to the GBS toxin receptor is then determined. AUematively, the GBS 
toxin is unlabeled and an antibody that recognizes GBS toxin is labeled instead. The 
labeled antibody is used to measure inhibition by a compound of GBS toxin binding 
to the GBS toxin receptor or fragment. In another embodiment, the GBS toxin 

15 receptor or fragment is not associated with a cell, but is instead coupled to a matrix, 
such as, for example, a well in a microliter plate or a bead. Additional suitable solid 
supports include latex, polystyrene beads (Interfacial Dynamics Corp. Portland, 
Oreg.), magnetic particles (Advanced Magnetics, Cambridge, Mass.) and nylon balls 
(Hendry et aL, J. Immunological Meth., 35:285-296, 1980). The receptor or fragment 

20 can be coupled to the matrix directly or indirectly through an antibody, coupled to the 
matrix, that binds the receptor fragment. In a third embodiment, the GBS toxin 
receptor or fragment is soluble and can be immunoprecipitated with an antibody that 
recognizes the receptor or fragment. 

A preferred method for identifying a compound which binds a mammalian 

25 GBS toxin receptor comprises the steps of (1) combining a test compound with a GBS 
toxin receptor or fragment thereof under conditions that allow specific binding to 
occur, and (2) detecting a complex formed between the test compound and the GBS 
toxin receptor or fragment. A preferred method is a competition assay which 
determines the ability of the test compound to compete for binding to the GBS toxin 

30 receptor or fragment. In such an assay, GBS toxin is combined with the GBS toxin 
receptor or fragment in the presence or absence of the test compound. Decreased 
specific binding of GBS toxin in the presence versus the absence of the test compound 
is indicative of the ability of the test compound to bind a mammalian GBS toxin 
receptor. Another method comprises combining a control compound with the GBS 
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toxin receptor or fragment under the same conditions as the test compound and 
comparing the amount of complex formation between the test compound or the 
control compound and the GBS toxin receptor or fragment thereof. Preferably, the 
test compound and/or the control compound are labeled. The test compound can be 
5 any of a number of classes of compounds, such as for example, small organic 
molecules (such as those used for and obtained by combinatorial chemistry), 
polysaccharides, polypeptides, RNA, antibodies, and single chain antibodies. In a 
preferred embodiment, the polypeptide is expressed by a cell, preferably on the 
surface of the cell, and preferably by a stable transfected cell. Such a system is 

10 particularly useful for testing the effectiveness of a chimeric compound comprising a 
cytotoxic agent. The cytotoxic activity of the compound can be determined by 
exposing a cell expressing the GBS toxin receptor on the cell surface to the test 
chimeric compound and detecting signs of cytotoxicity. One could detect such signs 
by a viability stain of the cell, by detecting apoptosis (for example, by a DNA ladder 

1 5 assay or a TUNEL™ stain, which binds to broken DNA), by measuring tritiated 
thymidine incorporation into the cell, and by quantitating kinase-dependent 
phosphorylation (e.g., using phosphoantibodies or various phosphoimaging 
techniques). 

In another embodiment, the invention provides a method for identifying an 

20 inhibitor of GBS toxin receptor. The method comprises incubating test cells in the 

presence and absence of a test compound. The test cells express GBS toxin receptor 

or a fragment thereof having GBS toxin receptor activity (e.g., a fragment that 

increases the proliferation or migration of the expressing cells relative to control cells 

of the same cell type that do not express the fragment). The test cells are incubated 

25 tmder conditions in which the cells incubated in the absence of the test compound can 

proliferate or migrate. Control cells that do not express the GBS toxin receptor or 

fragment proliferate or migrate less than cells that express the GBS toxin receptor or 

fragment. The proliferation or migration (also referred to herein as motility) of the 

test cells incubated in the presence or absence of the test compound is compared. Less 

30 proliferation or migration in the presence of the test compound than in the absence of 

the test compoimd is indicative of the test compound being an inhibitor of the GBS 

toxin receptor. Preferably, as a control to determine whether the test compound 

specifically inhibits the GBS toxin receptor, the proliferation or migration of control 

cells in the presence and absence of the test compound is also compared. In the 
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absence of a difference in the proliferation or migration of control cells incubated in 
. the presence or absence of the test compound, decreased proliferation or migration in 
test cells exposed to the test compound relative to test cells not exposed to the test 
cornpound is indicative of specific inhibition of the GBS toxin receptor. It will be 
5 readily apparent that the control portions of the method need not be performed 

contemporaneously with the test portions of the method. For example, control cells 
can be incubated with a battery of test compounds to determine cellular effects of the 
test compounds prior to incubating the test cells with the test compounds. Motility or 
migration can be determined by detecting movement of cells on a culture dish, 

10 Proliferation can be detected in a number of ways, including, but not limited to, 
measuring tritiated thymidine incorporation, cell counts, apoptosis assays, and 
viability assays. Preferred cells include cells transfected with GBS toxin receptor, 
preferably endothelial cells transfected with GBS toxin receptor, even more 
preferably vascular endothelial cells or microvascular endothelial cells. Primary cells 

1 5 that express GBS toxin receptor are also preferred, for example, endothelial cells that 
have been passaged in cell culture, at confluence, no more than 8 or 9 times. A 
preferred class of test compounds includes kinase inhibitors, preferably cAMP- 
dependent kinase inhibitors, PKC inhibitors, and CK2 inhibitors, which can be used 
as a starting point for developing more specific GBS toxin receptor inhibitors. 

20 Another class of compounds includes antibodies specific for GBS toxin receptor. 
Particularly preferred are single chain antibodies, preferably a collection of single 
chain antibodies that recognize various epitopes on the GBS toxin receptor. Less 
preferred are divalent antibodies specific for the binding site of the GBS toxin 
receptor ligand because they may trigger the signal transduction cascade upon 

25 dimerization. 

Another embodiment of the invention is a method of identifying an inhibitor 
of endothelial cell proliferation or migration, which are essential components of 
angiogenesis. The method basically comprises the steps described in the preceding 
paragraph and uses endothelial cells. 

30 Yet another embodiment of the invention is a method of identifying a 

therapeutic compound for the treatment or prevention of a medical condition 
characterized by pathologic or hypoxia-driven angiogenesis or neovascularization. 
The method basically comprises the steps described above and uses cells from tissues 
derived fit)m mammals afflicted with the medical condition or cells that serve as a 
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model for afflicted tissue. 

A preferred method for designing a compound which inhibits binding of a 
GBS toxin to a mammalian GBS toxin receptor comprises (1 ) simulating and 
selecting the most probable conformations of a GBS toxin receptor or fragment 
5 thereof, (2) designing a chemically modified analog that substantially mimics the 
energetically most probable three-dimensional structure of the GBS toxin receptor or 
fragment, (3) chemically synthesizing the analog, and (4) evaluating the bioactivity of 
the analog. Preferably, steps (a) and (b) are performed with the aid of a computer 
program. 

10 A preferred method for designing a compound which binds to a mammalian 

GBS toxin receptor comprises (1) simulating and selecting the most probable 
conformations of a GBS toxin receptor or fragment thereof, (2) deducing most 
probable binding domains of the receptor or fragment, (3) designing a compound that 
would form the energetically most probable complexes with the receptor or fragment, 

1 5 (4) chemically synthesizing the compound, and (5) evaluating the bioactivity of the 
compound. Preferably, steps (a)-(c) are performed with the aid of a computer 
program. 

Preferred polypeptides for use in the screening assays described above are 
polypeptides sharing at least about 85% identity, preferably at least about 95% 
20 identity, and most preferably greater than about 99% identity with the amino acid 

sequence of SEQ ID NO: 2 or SEQ ID NO: 4, or a fragment thereof having GBS toxin 
receptor activity. Most preferred are polypeptides having an amino acid sequence of 
SED ID NO: 2, 4 OR 8 or a fragment thereof having GBS toxin receptor activity. 

25 Methods Of PuRincATiON 

Another aspect of the invention is a method for purifying a compound that 
binds a GBS toxin receptor, for example, natural ligand, other polysaccharides, or an 
antibody specific for the GBS toxin receptor. The method comprises providing a 
polypeptide comprising a mammalian GBS toxin receptor or fragment thereof that 

30 binds GBS toxin, contacting the polypeptide with a sample comprising the compound 

imder conditions that allow specific binding of the compound to the polypeptide, and 

separating the bound compoimd from the remainder of the sample. The polypeptide 

can be soluble but preferably is immobilized on a substrate e.g., on a bead, membrane 

or on the surface of a cell, preferably a stable transfected cell. 
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Methods of Treatment 

GBS toxin receptor polypeptides and antibodies that interfere with GBS toxin 
binding can be used in a method of treatment of the human or animal body. For 
example, such inhibitors of GBS toxin binding can be administered to a patient to 
5 treat or prevent medical conditions involving GBS toxin binding to a GBS toxin 
receptor, such as, for example, early onset disease in the neonate. 
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GBS toxin mimetics or other compounds that bind and/or inhibit GBS toxin 

receptor, some of which can be identified by the drug discovery assays of the 

invention, can be used in a method of treatment of the human or animal body or can 

be used for the manufacture of a medicament for the treatment or prevention of any 

5 of a number of medical conditions involving pathologic and/or hypoxia-driven 

angiogenesis, such as, for example, cancerous tumors, chronic inflammatory diseases, 

scarring during wound healing or repair of neural injury. 

In a preferred embodiment, such a compound exerts its therapeutic effect by 

binding GBS toxin receptor and evoking an inflammatory response, as does GBS 

10 toxin. Preferably, such compounds comprise a sulfliydryl, hydroxyl, or amino group 

displayed so as to be available for binding complement C3. 

In another preferred embodiment, the compound is an inhibitor of GBS toxin 

activity. Preferred inhibitors include, but are not limited to, kinase inhibitors, single 

chain antibodies specific for the GBS toxin receptor, and antisense polynucleotides 

15 that specifically hybridize imder high stringency conditions to a GBS toxin receptor 

nucleic acid sequence, such as that of SEQ ID N0:1, SEQ ID NO:3 or SEQ ID NO:7. 

In another preferred embodiment, the compound exerts its therapeutic effect 

without evoking an inflammatory response. The compound can be used to deliver a 

cytotoxic agent to tissue in close proximity to a cell expressing a GBS toxin receptor, 

20 such as, for example, a tumor undergoing angiogenesis. Preferably, the compound is 

covalently attached to a cytotoxic agent and can be associated non-covalently with a 

cytotoxic agent, such as, for example, on the external surface of a liposome, micelle, 

or other lipophilic drug encapsulating structure. Preferred cytotoxic agents include 

antineoplastic agents commonly known in the art, such as, for example, 

25 mechlorethamine, chlorambucil, cyclophosphamide, melphalan, ifosfamide, and other 

alkylating agents, methotrexate and other folate antagonists, 6-mercaptopurine and 

other purine antagonists, 5-fluorouracil and other pyrimidine antagonists, cytarabine, 

ovinblastine, vincustine, and other vincas, etoposide and other podophyllotoxins, 

doxorubicin, bleomycin, mitomycin, and other antibiotics, carmustine, lomustine and 

30 other nitrosureas, cisplatin, interferon, asparaginase, tamoxifen, flutamide, and taxol. 

Other preferred biologic agents include sense and/or antisense RNA or DNA 

sequences derived fi-om specific tumor promoter or suppressor genes, such as, for 

example, the p53 and TGF gene families, signal transduction protein family members 

such as, for example, ras and myc, and growth factor receptor kinases such as, for 
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example flt2 and flkl, Tail, Tai2, and neuropholin, and other genes implicated in 
neoplastic disease and other diseases driven by pathologic angiogenesis. 



_ In another embodiment, GBS toxin receptor polypeptide or fragment thereof 
can be administered to a subject as a decoy to reduce the amount of stimulation of the 
5 GBS toxin receptor present in afflicted tissues (e.g., tumor tissues), thereby reducing 
cellular responses leading to proliferation and migration of cells of the afflicted 
tissues. Preferably, the GBS toxin receptor polypeptide or fragment is administered in 
soluble form, even more preferably sans transmembrane domains. 

10 Pharmaceutical Compositions 

Polypeptides of the invention that comprise a domain essential for GBS toxin 
binding that have the desired characteristics for bioavailability, stability and other 
important parameters of pharmacokinetics in vivo can be used as a competitive 
inhibitor of GBS toxin binding for medical conditions, such as, for example, early 

15 onset disease in the neonate, in which GBS toxin binding is undesirable. Appropriate 
polypeptides can include fragments having an amino acid sequence corresponding to 
a partial or fiill sequence of SEQ ID NO: 2 or SEQ ID NO: 4 or analogs thereof. 

Compoimds determined by assays using the polypeptides of the invention to 
bind and/or GBS toxin receptor and/or induce an inflammatory response, and that 

20 have the desired pharmacokinetic characteristics, can be used as treatments for 
medical conditions in which GBS toxin binding can be therapeutic, such as, for 
example, medical conditions involving pathologic or hypoxia-driven angiogenesis or 
neovascularization. 

Pharmaceutical compositions of the invention include a pharmaceutically 

25 acceptable carrier that may contain a variety of components that provide a variety of 
functions, including regulation of drug concentration, regulation of solubility, 
chemical stabilization, regulation of viscosity, absorption enhancement, regulation of 
pH, and the like. For example, in water soluble formulations the pharmaceutical 
composition preferably includes a buffer such as a phosphate buffer, or other organic 

30 acid salt, preferably at a pH of between about 7 and 8. Other components may 
include antioxidants, such as ascorbic acid, hydrophilic polymers, such as, 
monosaccharides, disaccharides, and other carbohydrates including cellulose or its 
derivatives, dextrins, chelating agents, such as EDTA, and like components well 
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known to those in the pharmaceutical sciences, e.g. Remington's Pharmaceutical 
Science, latest edition (Mack Publishing Company, Easton, PA). 

An effective amount of an active compound such as a GBS toxin receptor 
polypeptide, mimetic or analog, or GBS toxin mimetic or analog for particular 
5 applications depends on several factors, including the chemical nature of the 
polypeptide, mimetic or analog, the disorder being treated, the method of 
administration, and the like. Preferably, an effective amount will provide a 
concentration of polypeptide or mimetic of between about 0.0001 to 100 /mi at the 
target GBS toxin receptor on a cell surface, more preferably less than 10 /m/, with less 

10 than 1 jUM being most preferred. 

The active compoxmd can be administered to a mammalian host in a variety of 
forms, i.e., they may be combined with various pharmaceutically acceptable inert 
carriers in the form of tablets, capsules, lozenges, troches, hard candies, powders, 
sprays, elixirs, syrups, injectable or eye drop solutions, and the like depending on the 

1 5 chosen route of administration, e.g., orally or parenterally. Parenteral administration 
in this respect includes administration by the following routes: intravenous, 
intramuscular, subcutaneous, intraocular, intrasynovial, transepithelial (including 
transdermal, ophthalmic, sublingual and buccal), topical (including ophthalmic, 
dermal, ocular, rectal, nasal inhalation via insufflation and aerosol), and rectal 

20 systemic. 

The active compotmd may be orally administered, for example, with an inert 
diluent or with an assimilable edible carrier, it may be enclosed in hard or soft shell 
gelatin capsules, compressed into tablets, or incorporated directly with the food of the 
diet. For oral therapeutic administration, the active compound may be incorporated 

25 with excipient and used in the form of ingestible tablets, buccal tablets, troches, 
capsules, elixirs, suspensions, syrups, wafers, and the like. Such compositions and 
preparations should contain at lease 0.1% of active compound. The percentage of the 
compositions and preparations may, of course, be varied and may conveniently be 
between about 2% to about 6% of the weight of the unit. The amoxmt of active 

30 compoimd in such therapeutically usefiil compositions is such that a suitable dosage 
will be obtained. Preferred compositions or preparations according to the present 
invention are prepared so that an oral dosage imit form contains between about 1 and 
1000 mg of active compound. 
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Tablets, troches, pills, capsules and the hke may also contain the following: a 
binder such as polyvinylpyrrolidone, gum tragacanth, acacia, sucrose, com starch or 
gelatin; an excipient such as calcium phosphate, sodium citrate and calcium 
carBonate; a disintegrating agent such as com starch, potato starch, tapioca starch, 

5 certain complex silicates, alginic acid and the like; a lubricant such as sodium lauryl 
sulfate, talc and magnesium stearate; a sweetening agent such as sucrose, lactose or 
saccharin; or a flavoring agent such as peppermint, oil of wintergreen or cherry 
flavoring. Solid compositions of a similar type are also employed as fillers in soft and 
hard-filled gelatin capsules; preferred materials in this connection also include lactose 

10 or milk sugar as well as high molecular weight polyethylene glycols. When the 
dosage unit form is a capsule, it may contain, in addition to materials of the above 
type, a liquid carrier. Various other materials may be present as coatings or to 
otherwise modify the physical form of the dosage unit. For instance, tablets, pills, or 
capsules may be coated with shellac, sugar or both. A syrup or elixir may contain the 

15 active compound, sucrose as a sweetening agent, methyl and propylparabens as 
preservatives, a dye, flavoring such as cherry or orange flavor, emulsifying agents 
and/or suspending agents, as well as such diluents as water, ethanol, propylene glycol, 
glycerin and various combinations thereof. Of course, any material used in preparing 
any dosage unit form should be pharmaceutically pure and substantially non-toxic in 

20 the amounts employed. In addition, the active compound may be incorporated into 
sustained-release preparations and formulations. 

The active compound may also be administered parenterally or 
intraperitoneally. For purposes of parenteral administration, solutions in sesame or 
peanut oil or in aqueous propylene glycol can be employed, as well as sterile aqueous 

25 solutions of the corresponding water-soluble, alkali metal or alkaline-earth metal salts 
previously enumerated. Such aqueous solutions should be suitably buffered, if 
necessary, and the liquid diluent first rendered isotonic with sufficient saline or 
glucose. Solutions of the active compoimd as a fi-ee base or a pharmacologically 
acceptable salt can be prepared in water suitably mixed with a surfactant such as 

30 hydroxypropylcellulose. A dispersion can also be prepared in glycerol, liquid 

polyethylene glycols and mixtures thereof and in oils. Under ordinary conditions of 

storage and use, these preparations contain a preservative to prevent the growth of 

microorganisms. These particular aqueous solutions are especially suitable for 

intravenous, intramuscular, subcutaneous and intraperitoneal injection purposes. In 
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this connection, the sterile aqueous media employed are all readily obtainable by 
standard techniques well-known to those skilled in the art. 

The pharmaceutical forms suitable for injectable use include sterile aqueous 
solutions or dispersions and sterile powders for the extemporaneous preparation of 
5 sterile injectable solutions or dispersions. In all cases the form must be sterile and 
must be fluid to the extent that easy syringability exists. It must be stable under the 
conditions of nlanufacture and storage and must be preserved against the 
contaminating action of microorganisms such as bacteria and fungi. The carrier can 
be a solvent or dispersion medium containing, for example, water, ethanol, polyol (for 

10 example, glycerol, propylene glycol, liquid polyethylene glycol and the like), suitable 
mixtures thereof, and vegetable oils. The proper fluidity can be maintained, for 
example, by the use of a coating such as lecithin, by the maintenance of the required 
particle size in the case of a dispersion and by the use of surfactants. The prevention 
of the action of microorganisms can be brought about by various antibacterial and 

15 antifungal agents, for example, parabens, chlorobutanol, phenol, sorbic acid, 

thimerosal and the like. In many cases it will be preferable to include isotonic agents, 
for example, sugars or sodium chloride. Prolonged absorption of the injectable 
compositions can be brought about by use of agents delaying absorption, for example, 
aluminum monostearate and gelatin. 

20 Sterile injectable solutions are prepared by incorporating the active compound 

in the required amount in the appropriate solvent with various of the other ingredients 
enumerated above, as required, followed by filtered sterilization. Generally, 
dispersions are prepared by incorporating the sterilized active ingredient into a sterile 
vehicle which contains the basic dispersion medium and the required other ingredients 

25 from those enumerated above. In the case of sterile powders for the preparation of 
sterile injectable solutions, the preferred methods of preparation are vacuum drying 
and the freeze drying technique which yield a powder of the active ingredient plus any 
additional desired ingredient fh)m the previously sterile-filtered solution thereof. 
For purposes of topical administration, dilute sterile, aqueous solutions 

30 (usually in about 0.1% to 5% concentration), otherwise similar to the above parenteral 

solutions, are prepared in containers suitable for drop-wise administration to the eye. 

The compounds of this invention may be administered to a mammal alone or in 

combination with pharmaceutically acceptable carriers. As noted above, the relative 

proportions of active ingredient and carrier are determined by the solubility and 
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chemical nature of the compound, chosen route of administration, the particular 
compound chosen and the physiological characteristics of the particular patient under 
treatment. 

5 Kits 

Yet another aspect of the invention is a kit for use in carrying out any of the 
above methods. A preferred embodiment is a kit comprising a GBS toxin receptor or 
fragment thereof. Preferably, the receptor or fragment is immobilized. A preferred 
kit can be used for identifying a compound that binds to GBS toxin receptor, and 

10 comprises at least one cell that expresses GBS toxin receptor. 

Another embodiment is a kit for monitoring tumor growth or metastasis, 
comprising a reagent for detecting expression of a GBS toxin receptor. Examples of 
such reagents include, but are not limited to, polynucleotide probes that hybridize to a 
GBS toxin receptor nucleic acid sequence and compounds that bind to a GBS toxin 

15 receptor, such as, for example, an antibody that specifically recognizes GBS toxin 
receptor, a GBS toxin, a GBS toxin mimetic, or other compounds identified by the 
screening methods described above. 
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A third embodiment is a kit for purifying a compound that binds a GBS toxin 
receptor, comprising a GBS toxin receptor or fragment thereof that binds the 
compound. Preferred compounds include GBS toxin, GBS toxin mimetics, antibodies 
that specifically bind GBS toxin receptor, and other compounds identified by the 
5 screening methods described above. 

Additional kit components can include, but are not limited to, additional 
reagents required for detection, a reference standard(s), instructions for use, and the 
like. Suitable reference standards include positive controls, negative controls, 
photographs of such controls, tabulated or graphed data of such controls, and the like. 
10 The kits may further comprise instructions for carrying out the methods described 
above, preferably printed instructions. 

Example 1 - Cloning Sheep GBS Toxin Receptor 

15 Primary CMlturQ of ?hegp \mg endQthelial gell? 

Small pieces of primary lung tissues from a 7-week old sheep are cut 
into small pieces in Hank's balanced salt solution (HBSS) containing 10 mM HEPES 
buffer (Life Technology), 1% penicillin/streptomycin and 0.1% gentamycin, and are 
cultured in sheep lung complete medium (Life Technology) at 37°C. After one week 

20 of the culture, clones of sheep lung endothelial cells are identified by Cobblestone 
morphology and harvested into 24-well tissue culture plates (Falcon) using cloning 
rings. When the cells are confluent, they are detached by pancreatin and transferred 
to a 60-nim tissue culture Petri dish or a T-25 tissue culture flask (Falcon). When 
they are confluent again, they are split and cultured into a few lOO-nmi tissue culture 

25 plates (Falcon). Each split is considered to be one passage. The same procedure is 
repeated imtil enough cells (-^10^) are obtained for isolation of mRNA. 

Isolation of mR NA and construction of cDNA library 
Poly(A)+ RNA is isolated from 9,2 x 10^ sheep lung endothelial cells 
(passage 8 and 9) by a standard method (Pharmacia). A total of 1 6 |ig poly( A)"*^ RNA 

30 is acceptable amount obtained. 2.5 jig mRNA can be used to construct a cDNA 

library. Poly(A)'' RNA is oligo(dT)-primed (with Not I restriction site) and converted 
into double-stranded cDNA. After adding a BstXVEcoR I adaptor, the cDNA is 
unidirectionally cloned into the BstXl and Not I sites of pCDNA3. 1 (+) (Invitrogen). 
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£, coli ToplOF' (Invitrogen) is used as a host strain for amplification. 5.38 x 10** 
primary clones are an acceptable number generated. The library is amplified by 
plating cells onto fifty large LB agar plates containing ampicillin (100 |ig/ml). The 
plates are scraped and aliquoted so that each aliquot represents 10 plates. DNA is 
5 purified by Qiagen Max columns (Qiagen). 

Screening of cDNA library for a gene encoding GBS t oxin receptor 
To screen a cDNA library for a gene encoding GBS toxin receptor 
gene, a unique colorimetric method is used. Five i^g plasmid DNA fi-om each pool of 
cDNA library is used to transfect C0S7 cells. The transfected cells are cultured in 
10 four to eight 96-well tissue culture plates (Falcon) for transient expression. Each well 
contains about 20,000 transfected cells in DMEM medium (Life Technology). C0S7 
cells transfected with pCDNA3.1(+) are used as a control. After 3 days expression, 
the medium is carefully removed. Each well is rinsed 3 times with HPSS buffer 
containing Mg"^ and Ca^"*" (wash buffer) (Life Technology). 
15 The cells are then incubated with biotinylated toxin (50/// per well; 1 to 

1.5 //g/ml) at room temperatxire for 1 h. After the hour incubation, the biotinylated 
toxin is discarded and the wells are rinsed 3 times with the wash buffer. The cells are 
incubated with streptavidin-6-gal solution and each well is rinsed 3 times with the 
wash buffer. The cells are then incubated with PNPG (50 |il per well; 1 mg/ml in 
20 substrate buffer) at 37*^C. Absorbance at 405 nm is measured by an ELISA reader at 
1 and 20 h, respectively. The cells which give the highest OD are harvested. Plasmid 
DNA is isolated by Hirt extraction. Plasmid DNA is amplified in E. coli to have 
enough DNA for the next transfection (enrichment). 

Enrichment is done 8 times by this colorimetric method. The number 
25 of the transfected cells loaded into each well is gradually decreased in the last few 
enrichments and untransfected cells are added to each well to give a total number of 
20,000 cells per well for the cells to be confluent and to reduce backgroimd after 3 
days' expression. At the last enrichment, each well has only 1 to 10 transfected cells. 
Cells giving the highest OD are harvested. DNA is isolated and amplified in E. coli. 
30 A number of isolated clones are individually assayed by this 

colorimetric method. The clones which showed higher binding to CMlOl are 
sequenced. 

Sequence analysis 
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DNA sequence analysis of clone pFU102, which has a 2.1kb insert, 
revealed a sequence encoding a partial integral glycoprotein. N-terminal sequence 
was obtained by 5 'RACE method (Life Technology) and a full-length gene is 
designated as SP55. Triple ligation yielded pCD55, which contains an entire coding 
5 region of SP55. 

mRNA for the SP55 has 2844 nucleotides, encoding a protein of 495 amino 
acids with a predicated mass of 55 KDa, SP55. Analysis by the method of Klein et al. 
(Klein et al., Biochim Biophys Acta. 815:468-476 (1985)) classifies SP55 as an 
integral protein with seven transmembrane segments. SP55 has both N-glycosylation 
10 and kinase phosphorylation sites. A Swiss-Prot. search of SP55 did not reveal any 
high homology to known human proteins. However, SP55 has some identity (- 30%) 
to renal sodium-dependent phosphate transporters from human, rabbit, mouse and rat. 
In addition, SP55 has some identity 30 to 39%) to hypothetical proteins (HYP50 
and HYP63) from C. elegans, 

15 

Example 2 - Cloning Human GBS Toxin Receptor 

The sheep GBS toxin receptor sequence shares about 37% identity 
with HYP50 and about 33% identity to HYP63, two hypothetical proteins from C 
elegans. In the regions corresponding to amino acid residues 1 80-1 86 and 443-449 of 

20 SEQ ID No. 2. five amino acids within a seven amino acid stretch are absolutely 
conserved among the three proteins. 

A first degenerate oHgonucleotide, CMR3-S: 5'- 
CGGGATCCCGCCNGCNATGCAYRSHRTSTGG-3'(SEQ ID No. 5), was designed 
to include all possible codons encoding the amino acid sequences of SP55, HYP50, 

25 and HYP63 in the 180-186 region. A second degenerate oligonucleotide, CMR4- 
AS2: 5'-GGAATTCCDGGDGCRATKTCNARRTRRTT-3' (SEQ ID No. 6), was 
designed to include the complementary sequences of all possible codons encoding the 
amino acid sequences of SP55, HYP50, and HYP63 in the 443-449 region. 

Polymerase chain reaction (PGR) was conducted using these 

30 oligonucleotides and a human embryo lung cDNA library as a template. The reaction 
yielded three overlapping sequences approximately 400 bp in size, which encompass 
part of the nucleic acid sequence of SEQ ID No. 3. These sequences were then used 
as probes to clone the remainder of the gene, referred to herein and HP59 (SEQ ED 
NO: 7). 
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Example 3 - Preparation of Antibodies Against GBS Toxin Receptor 

Rabbits are immunized with the synthetic peptides shown in Table S. 
A hmg/ml solution of peptide plus KLH in O.OIM phosphate buffer is prepared. For 
5 the first immunization, 200 |ig of peptide plus KLH (200 ^il) and an equal volume of 
Freunds complete adjuvant, emulsified well before injection, is injected into 3-4 spots 
on the dorsal surface about the neck and shoulders of a rabbit. After two weeks, the 
second immunization (boost) is given at the same concentration of immunogen, but 
emulsified in Freunds incomplete adjuvant. The boost is delivered in the same region 
10 of the body. After another two weeks, blood is collected and assayed by ELISA for 
response against the peptide without KLH. Further boosts are given to improve 
antibody titer, if necessary. 

Table 8 

15 Immunogenic Peptides 



Peptide Amino Acid Sequence Sjzg 

p56a APSDGEEGSDRTPLLQRAPRAEPAPVC 27 aa 



SEP ID Ref. 



p55a 
p57a 



LAPSDGEEGSDRTPL 



NTTAKDNRTSYECA 



15 aa 



14 aa 



residues 8-35 of 
SEQ ID N0:4 

residues 7-22 of 
SEQ ID NO: 4 

residues 71-84 of 
SEQ ID NO: 4 

Peptide p55 is a fragment of an extracellular domain of GBS toxin receptor. 
Peptide p57a is a fragment of an intracellular domain of GBS toxin receptor. Animals 
immunized with these peptides produce polyclonal antibodies Pab55 and Pab57, 
20 respectively. 



Example 4 - Detection of GBS Toxin Receptor Expression in Tumor 
Cells 

This example shows that GBS toxin receptor can be detected in tumor cells. 
25 Immunohistochemistry is performed on paired human and mouse tissues of normal or 
tumor origin, using rabbit polyclonal antibodies Pab 55 and Pab 57. 
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Mouse and human tumor tissues are fixed in 10% neutral formalin. The 
tissues are then dehydrated, paraffin embedded and 10-20 x 8-micron sections are cut 
for immunohistochemical staining. 

- Immunohistochemical analysis is performed with the automated Ventana 
5 Immunohistochemical Stainer according to the manufacturer's suggested protocol 
(Ventana, Tucson, Arizona). Sections are deparaffinated with xylene. The prepared 
sections are then treated with 1% hydrogen peroxide prepared in 30% aqueous 
methanol for 20 minutes at room temperature to quench endogenous peroxidase 
activity. The slides are then washed with PBS, blocked with 5% BSA and 5% goat 
10 serum in PBS, washed again and then incubated for 30 minutes at 37*^0 with the 

appropriate diluted (1:100) antibody. Horseradish peroxidase-labeled goat anti-rabbit 
IgG is used as a secondary antibody. For visualization, the sections are incubated 
with DAB/H2O2. The sections are finally incubated with a copper enhancer (Ventana) 
for 4 minutes, washed, counterstained with hematoxylin, and mounted in toluene- 
1 5 minus mounting medium. Photographic documentation is performed and images are 
stored for later review and analysis. The results are summarized in Table 9. The 
numbers refer to glass slides. 

TABLE 9 

20 Immunohistochemistry of tumor and normal tissues 

(diff. = differentiated) 



Human tissues: 





Antibody 


Magnification 


Signal 


1. Ovary tumor ( 95-02VO16) high grade 


Pab 55 


400x 


+ 


papillary carcinoma 








2. Ovary tumor ( 95-02VO16) high grade 


Pab 55 


400x 


+ 


papillary carcinoma 








3. Nomial ovary ( 96-08ZO08) control 


Pab 55 


400x 




tissue 








4. Ovary tumor ( 95-02VO16) high grade 


Pab 57 


400x 


+ 


papillary carcinoma 








5. Ovary tumor ( 9 5 -02 VO 16) high 


Pab 57 


400x 


+ 


grade papillary carcinoma 








6. Ovary tumor ( 95-02VO16) high grade 


Pab 57 


400x 


+ 


papillary carcinoma 








7. Nornial ovary 96-08ZO08) control 


Pab 57 


400x 




8. Colon cancer 95-14664) poorly diff. 


Pab 55 


400x 


+ 


Adenocarcinoma 








9. Normal colon 9708VO08) control 


Pab 55 


400x 




10. Colon cancer 95-14664) poorly diff. 


Pab 57 


400x 





Adenocarcinoma 
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Pab 57 


400x 




Aaenocarcinonia 








il. fNormai coion y/uovL^uo^ conrroi 


r aU J I 






a. r€maie Dreasi cancer y/-ivjvuja) 


r aD J J 




+ 


invaSlvc TTiuiTirnary Larcinujna 








1-T. iviaic DiCdsi Cancer ^^rio cuul^ 


Pah 
raU 




1 


rnarniTiary carciiiuiiia 








1^ 'WnrmQl fpmnlp hrpa«:t 07- 1 9Vn?n-'^ ^ 


Pab 55 


400x 

*T V/ V A 




control 








10, remaie oreasi cancer y/-itjvuja; 


PaK ^7 

r ao J / 




+ 


Invasive manunary carcinoma 








17. Male breast cancer (no code) 


rao D / 




+ 


mammary carcinoma 








lo, iNormai lemaie oreasi \y / ' izvyjzv-j) 


Pah S7 


HUWa 




control 








ly. i^ung cancer yy/' i vjvuzz-j^ poony 


Pah 


HUva 


1 

-r 


diff. NOJ-small cell carcinoma 








'70 Normal Inner ( QR-0 1 VO 1 H control 


Pab 55 






21. Lung cancer ( 97-10VO22-5) poorly 


Pab 57 


400x 




diff. NOJ-small cell carcinoma 








22. Lung cancer ( 97-10VO22-5) poorly 


Pab 57 


400x 


+ 


diff. NOJ-small cell carcinoma 








23. Normal lung ( 98-0 1 VO 1 1) control 


Pab 57 







Mouse Tissues: 



Antibody Magnification Signal 



24. Madison Lung Tumor (MLT) untreated Pab 55 + 
with CM 101 

25 . MLT untreated with CM 1 0 1 Pab 55 + 

26. Normal mouse lung Pab 55 - 

27. MLT untreated with CM 1 0 1 Pab 57 + 

28. Normal mouse lung Pab 57 

The Pab 55 antibody stains the cells lining a blood vessel in a human ovary 

cancer tissue section, but such staining is not apparent in cells of normal human ovary 
tissue (see FIG. 2A and 2B, respectively). Similar results are obtained with the Pab 
5 57 antibody (see FIG. 3A and 3B). As shown in the above table and in FIGS. 2A-3B, 
antibodies raised to GBS toxin receptor fragments specifically bound to tumor tissues 
but not normal tissues, suggesting that GBS toxin receptor is expressed in tumor cells 
but not normal cells. 



1 0 Example 5- Detection of GBS Toxin Receptor Expression in Mice 

AFFLICTED WITH RHEUMATOID ARTHRITIS 

This example shows that GBS toxin receptor can be detected in cells from a 
mammalian model for rheumatoid arthritis (RA). Mice with collagen-induced 
arthritis were treated with CM 101 or carrier. CMlOl reversed the inflammatory 
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damage and inhibited pannus formation. Mouse #8 and #15, which were treated with 
CMIOL and two control mice (not treated with CM 101) were sacrificed for 
immunohistochemistry. 



TABLE 10 

Immunohistochemistry of Rheumatoid Arthritic Mice 

29. No CM 101 Pab55 + 

30. MOUSE 8 - 5' (vessel) Pab 55 + 

31. No CM 101 Pab 57 -f 

32. MOUSE 15 - 5' (vessel) Pab 57 + 

33. MOUSE 8 -5' (between joint) Pab 57 + 

34. MOUSE 15-5' Pab 57 + 

35. No CM 101 (marrow) Pab 57 + 

36. MOUSE 15-5' (marrow) Pab 57 + 



As shown above Pab55 and Pab57 specifically bound to pathologic 
neovasculature in the pannus, suggesting that GBS toxin receptor is expressed in mice 
10 afflicted with rheumatoid arthritis. No binding of CMlOl was observed in the normal 
neovasculature in the growth plate of the joints of the arthritic mice. 

Example 6 - Targeted Delivery of a Chimeric Compound to Tissues 
Expressing GBS Toxin Receptor 

1 5 This example shows the targeted delivery of a chimeric compound to tissues 

expressing GBS toxin. The chimeric compound is a CMlOl-biotin conjugate. Mice 
with Madison Lung Tumors (MLT) are infused intravenously (i.v.) with biotinylated 
CM 101. 

CM 101 has been reacted with hydrazinylated biotin to form the biotin 
20 hydrazone at the reducing end of the polysaccharide CM 101. Briefly, 25 micrograms 
of lyophilized CMlOl is dissolved in 250 \il labeling buffer at 100 mM sodium 
acetate, 0.02% sodium azide. Aqueous meta-periodate (125 \xl of 30 mM) is added 
and the oxidation is allowed to proceed in the dark for 30 minutes at room 
temperature. The reaction is terminated by adding 80 mM NajSOa to the solution. 
25 The resultant aldehydes are reacted with 125 |il of 5 mM NHS-LC-Biotin (MW 
556.58) for a 1 hour incubation at room temperature to form biotinylated CMlOl . 
Excess biotin is removed by dialysis against 1 liter of PBS at 4^C four times. The 
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product is purified by gel filtration on an Ultrahydrogel 1000 HPLC. lyophilized and 
stored at -70°C until use. 

Tissues are recovered 5 min post infusion with CM 101 and subjected to 
imfnunohistochemistry. Tumor and normal mouse tissue sections are analyzed for CM 
5 101 binding by both mouse anti-CMlOl mAb (7 A3), followed by secondary mAb- 
HRP conjugate (referred to in FIG. 4B as MLT CMIOI-Biot.5' + McAb), or with 
avidin (which specifically binds biotin) conjugated with HRP (referred to in FIG. 4A 
as MLT CMIOI-Biot.5' + Strep.HRP). 

FIGS. 4A-4C depict different sections taken from the same tumor and include 

1 0 a longitudinal view of the same blood vessel approximately in the center of the 

figures. The dark staining in FIG. 4A shows the localization of the biotin component 
in the cells lining the blood vessel. Similarly, FIG. 4B depicts the localization of the 
CM 101 component in the cells lining the blood vessel. FIG. 4C is a negative control 
that was not exposed to CMIOI. The analysis clearly shows that 7 A3 and avidin bind 

1 5 to the same blood vessels in tumor tissue. Thus, biotin has been delivered to the 

blood vessel of the tumor tissue by virtue of its physical association with a compound 
(CM 101) that binds the GBS toxin receptor. 

These studies show that chimeric compounds can be delivered to tissues 
undergoing pathologic and/or hypoxia-driven angiogenesis or neovascularization. As 

20 part of a chimeric compound, cytotoxic molecules can be directed to such tissues, e.g., 
tumor tissue. The cytotoxic molecule can be coupled directly to a molecule that binds 
GBS toxin receptor, e.g., GBS toxin. Alternatively, the molecule that binds GBS 
toxin receptor can be coupled to biotin and the cytotoxic molecule can be coupled to 
avidin. 
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Example 7 - Enhanced Sensitivity to GBS-Toxin-Dependent Cytotoxicity 
OF Cells Expressing GBS Toxin Receptor 

This example shows the enhanced sensitivity to GBS-toxin-dependent 
cytotoxicity of cells transfected with the GBS toxin receptor, relative to control cells. 
5 Without being bound to a particular theory, the inventors believe that complement 
binds GBS toxin bound to the GBS toxin receptor on a cell, thereby targeting the cell 
for killing by white blood cells (WBC). 

Human bladder carcinoma cells (ECV cells), are stable transfected with the 
human GBS toxin receptor gene. The resultant cell line is ECV71 1 . Cells stable 
10 transfected with vector alone as referred to as V23. ECV 71 1 and V23 are seeded in 
96- well plates at 5,000 cells/well. 

White blood cells are collected from healthy human donors as follows. Blood 
is collected by standard phlebotomy procedures into heparinized tubes (30 U/ml) and 
centrifiiged at 2000 rpm for 20 min. The interface is carefully transferred to a new 
15 tube and washed twice by centrifugation with medium (RPMI-1640). Cells are 
resuspended in RPMI-1640 supplemented with 5% fetal bovine serum (FBS) and 
Interferon-gamma (IFN) at 100 U/ml, and incubated overnight in a 37^C, 5%C02 
incubator. The cells are then resuspended in fresh medium with 5% FBS. 

5,000 cells of the WBC preparation are added to each well containing the 
20 transfected cells. CM 101 is added to a final concentration of l|ag/ml to the wells 

together with human serum from matching human donors. The cells are incubated 6 
hours at 37 C. 

Cytotoxicity is assayed by measuring lactate dehydrogenase (LDH) using the 
Promega's CytoTox 96 Non-Radioactive Assay kit (Nachlas et al. (I960) Anai 

25 Biochem 1,317; Korzeniewski et al. (1983) J, Immunol Methods 64, 313; Decker et 
al. 1 Immunol Methods 1 15, 61; Brander et al. (1993) Eur, J. Immunology 23, 3217; 
Behl et al. (1994) Cell 11, 817; Lappalainen et al. (1994) Pharm. Research 11,1 127; 
Allen et al. (1994) Promega Notes 45, 7; Sinensky et al. (1995) Toxicol Letters 75, 
02; Moravec (1994) Promega Notes 45, 11). Percent cytotoxicity is calculated as 

30 recommended by the manufacturer's instructions. The results are shown in Table 1 1 . 
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Table 11 



Cytotoxicity 


ECV711 


V23 


WBC, IFN, C3, 
-CMlOl 


29.1% 


27.5% 


WBC, IFN, C3, 
+CM101 


40.45% 


22.46% 



There is an increase in cytotoxicity of 39% when the ECV 711 cells are 
incubated with CMlOl, WBC and human serum (source of C3) compared to cells 
5 incubated without CMlOl . Control cells transfected with vector alone, V23, do not 
show a CMlOl dependent increase in cytotoxicity. 

All publications and patent applications mentioned in this specification are 
herein incorporated by reference to the same extent as if each individual publication 
or patent application was specifically and individually indicated to be incorporated by 
10 reference. 

The invention now being fully described, it will be apparent to one of ordinary 
skill in the art that many changes and modifications can be made thereto without 
departing from the spirit or scope of the appended claims. 
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1 . An isolated polynucleotide at least 10 bases in length comprising a nucleic 
acid sequence encoding, or complementary to a nucleic acid sequence encoding, a 
mammalian receptor for group B P-hemolytic Streptococcus toxin (GBS toxin 

5 receptor), or a polypeptide fragment thereof. 

2. The polynucleotide of Claim 1, wherein the nucleic acid sequence 
comprises SEQ ID NO: 9. 

3. The polynucleotide of Claim 1, wherein the nucleic acid sequence has 
100% identity to a nucleic acid sequence selected from the group consisting of 

10 residues 61 to 1542 of SEQ ID NO: 1, residues 266 to 1870 of SEQ ID N0:7, and 
residues 87 to 1568 of SEQ ID NO: 3. 

4. The polynucleotide of Claim 1 , wherein the polynucleotide is hybridizable 
under high stringency conditions to the nucleic acid sequence of SEQ ID NO: 7. 

5 . A vector comprising the polynucleotide of Claim 1 . 
15 6. A host cell transformed with the vector of Claim 5 . 

7. A process for producing a mammalian GBS toxin receptor or fragment 
thereof, comprising culturing the host cell of Claim 6 in a suitable culture medium. 

8. An isolated polypeptide comprising a mammalian GBS toxin receptor or 
fragment thereof 

20 9. The polypeptide of Claim 8, wherein the receptor has at least about 86% 

identity to the corresponding amino acid sequence of SEQ ID NO: 2. 

10. The polypeptide of Claim 8, wherein the receptor or fragment has 100% 
identity to the corresponding region of the amino acid sequence of SEQ ID NO: 4 or 
SEQ ID NO: 8. 

25 11. The polypeptide of Claim 8, wherein the polypeptide is encoded by a 

nucleic acid sequence hybridizable under high stringency conditions to a nucleic acid 
sequence selected from the group consisting of: 
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a) nucleotides 61 to 1542 of SEQ ID NO: 1, and 

b) nucleotides 87 to 1568 of SEQ ID NO: 3. 

12. An isolated polypeptide comprising an amino acid sequence that differs 
from an amino acid sequence selected from the group consisting of SEQ ID NO:2, 
5 SEQ ID N0:4 and SEQ ID NO: 8 at no more than about 20% of the amino acid 
residues. 



13. The isolated polypeptide of Claim 12, wherein the amino acid sequence of 
said isolated polypeptide differs from the amino acid sequence selected from said 
group by one amino acid residue. 

10 14. The isolated polypeptide of Claim 12, wherein the different amino acid 

residues are conservative substitutions of the corresponding residues of the amino 
acid sequence selected from said group. 

1 5. An isolated polypeptide comprising an amino acid sequence of the 
formula: 



15 AAl-AAn-AAm 

wherein: 

AAl is absent or is M; 

AAn is a contiguous chain of 0 to 100 amino acids, preferably of 0 or 41 
amino acids, even more preferably of residues 2-42 of SEQ ID N0:8; and 

20 AAm is a contiguous chain of 494 amino acids comprising AA43 through 

AA536, wherein: 

(1) each of AA43, AA47, AA51, AA52, AA57, AA58, AA65, AA66, 
AA72, AA85, AA87, AA93, AA94, AA96, AAl 15, AAl 16, AA122, AA123, 
AA125, AA134, AA143, AA173, AA174, AA178, AA185, AA186, AA189, AA190, 

25 AA196, AA200, AA204, AA206, AA207, AA220, AA253, AA260, AA276, AA277, 
AA280, AA283, AA287, AA294, AA295, AA298, AA300, AA301, AA312, AA324, 
AA326, AA360, AA365, AA373, AA374, AA379, AA396, AA403, AA407, AA418, 
AA480, AA483, AA486, AA491, AA494, AA502, AA528, AA529, AA532 and 
AA536 is an amino acid residue corresponding to: 

30 (a) residue 43, 47, 51, 52, 57, 58, 65, 66, 72, 85, 87, 93, 

94, 96, 115, 116, 122, 123, 125, 134, 143, 173, 174, 178, 185, 186, 189, 190, 196, 
200, 204, 206, 207, 220, 253, 260, 276, 277, 280, 283, 287, 294, 295, 298, 300, 301, 
312, 324, 326, 360, 365, 373, 374, 379, 396, 403, 407, 418, 480, 483, 486, 491, 494, 
502, 528, 529, 532 and 536, respectively, of SEQ ID N0:8; 

35 (b) residue 2, 6, 10, 11, 16, 17, 24, 25, 31, 44, 46, 52, 

53, 55, 74, 75, 81, 82, 84, 93, 102, 132, 133, 137, 144, 145, 148, 149, 155, 159, 163, 
165, 166, 179, 212, 219, 235, 236, 239, 242, 246, 253, 254, 257, 259, 260, 271, 283, 
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285, 319, 324, 332, 333, 338. 355. 362, 366, 377, 439, 442. 445, 450. 453, 461. 487, 
488, 491 and 495, respectively of SEQ ID N0:4; or 

(c) a conservative substitution thereof; 

(2) each of AA44-AA46, AA48-AA50, AA53-AA56. AA59-AA64, 
5 AA67-AA71, AA73-AA84, .A.A86, AA88-AA92, AA95, AA97-AA1 14, AA117- 

AA121, AA124, AA126-AA133, AA135-AA142, AA144-AA172, AA175-AA177, 
AA179-AA184, AA187-AA188. AA191-AA195, AA197-AA199. AA201-AA203, 
AA205, AA208-AA219, AA221-AA252, AA254-AA259. AA261-AA275, AA278- 
AA279, AA281-AA282, AA284-AA286, AA288-AA293, AA296-AA297, AA299, 
10 AA302-AA311, AA313-AA323, AA325, AA327-AA359, AA361-AA364, AA366- 
AA372, AA375-AA378, AA380-AA395, AA397-AA402, AA404-AA406, AA408- 
AA417, AA419-AA478, AA481-AA482, AA484-AA485, AA487-AA490, AA492- 
AA493. AA495-AA501, AA503-AA527, AA530-AA531 and AA533-AA535 is 

(a) residue 44-46, 48-50, 53-56, 59-64, 67-71, 73-84, 
15 86, 88-92,95,97-114, 117-121, 124, 126-133, 135-142, 144-172, 175-177, 179-184, 

187-188, 191-195, 197-199, 201-203, 205, 208-219, 221-252, 254-259, 261-275, 278- 
279, 281-282, 284-286, 288-293, 296-297, 299, 302-311, 313-323, 325, 327-359, 
361-364, 366-372, 375-378, 380-395, 397-402, 404-406, 408-417, 419-478, 481-482, 
484-485, 487-490, 492-493, 495-501, 503-527, 530-531 and 533-535, respectively, of 
20 SEQ ID N0:8; or 

(b) a conservative substitutions thereof; and 

(3) one or more of AA315 through AA367 are optionally absent. 

16. An antibody that recognizes a mammalian GBS toxin receptor or fragment 

thereof. 

25 17. An isolated complex comprising a GBS toxin bound to a mammalian GBS 

toxin receptor or fragment thereof. 

18. A method of forming a complex comprising: 

contacting a GBS toxin with a polypeptide comprising a mammalian 
GBS toxin receptor, or fragment thereof that can bind GBS toxin, under conditions 
30 that permit specific binding of the GBS toxin to the polypeptide, and 
allowing the complex to form. 

19. A method for purifying a compound that binds a GBS toxin receptor, 
which method comprises: 

providing a polypeptide comprising a mammalian GBS toxin receptor 
35 or fragment thereof that binds GBS toxin; 

contacting said polypeptide with a sample comprising the compound 
under conditions that allow specific binding of the compound to the polypeptide; and 
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separating the bound compound froni the remainder of the sample. 



20. A method of determining the presence or absence of GBS toxin in a 
sample, which method comprises: 

contacting the sample with a polypeptide comprising a mammalian 
5 GBS toxin receptor, or fragment thereof that can bind GBS toxin, under conditions 
that permit specific binding of the GBS toxin to the polypeptide, 

and determining whether specific binding has occurred. 

21 . A method for diagnosing early onset disease in a neonate comprising 
performing the method of Claim 20, wherein the sample is obtained from the neonate 

10 and wherein presence of the GBS toxin is indicative of early onset disease. 

22. A method for detecting pathologic vasculature in a mammalian tissue, 
which method comprises detecting the presence of a GBS toxin receptor. 

23. A method for identifying a compound which inhibits binding of a GBS 
toxin to a mammalian GBS toxin receptor, comprising: 

1 5 combining a test compound with a polypeptide comprising a 

mammalian GBS toxin recqjtor, or fragment thereof that can bind GBS toxin, in a 
reaction mixture containing GBS toxin and under conditions that permit specific 
binding of the GBS toxin to the receptor or fragment, and 

determining the amount of inhibition by the compound of the binding 

20 of the GBS toxin to the polypeptide. 

24. An inhibitor of binding of a GBS toxin to a mammalian GBS toxin 
receptor. 

25. A method for identifying a compound which specifically binds a 
mammalian GBS toxin receptor, comprising: 

25 combining a test compound with a polypeptide comprising a 

mammalian GBS toxin receptor or fi^gment thereof that can bind GBS toxin, under 
conditions that allow specific binding to occur, and 

detecting a complex formed between said test compound and said 
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26. A method for determining cytotoxicity of a test chimeric compound, which 
method comprises: 

exposing a cell expressing, on the cell surface, a mammalian GBS 
5 toxin receptor, or fragment thereof that binds GBS toxin, to a test chimeric compound 
comprising a cytotoxic agent coupled to said GBS toxin; and 
detecting signs of cytotoxicity. 

27. A chimeric compound comprising a cytotoxic agent covalently linked to a 
molecule that specifically binds a mammalian GBS toxin receptor. 

10 28. A method for identifying an inhibitor of GBS toxin receptor, which 

method comprises: 

incubating test cells in the presence and absence of a test compound 
and under conditions in which the cells incubated in the absence of the test compound 
can proliferate or migrate, wherein the test cells express GBS toxin receptor or a 
1 5 fragment thereof having GBS toxin receptor activity; and 

comparing the proliferation or migration of the test cells incubated in 
the presence of the test compound to the proliferation or migration of the test cells 
incubated in the absence of the test compotmd, wherein less proliferation or migration 
in the presence of the test compound is indicative of the test compound being an 
20 inhibitor of the GBS toxin receptor. 

29. A method for identifying an inhibitor of endothelial cell proliferation or 
migration, which method comprises: 

incubating test endothelial cells in the presence and absence of a test 
compound and under conditions in which the cells incubated in the absence of the test 
25 compound can proliferate or migrate, wherein the test cells express GBS toxin 
receptor or a fragment thereof having GBS toxin receptor activity; and 

comparing the proliferation or migration of the test cells incubated in 
the presence of the test compound to the proliferation or migration of the test cells 
incubated in the absence of the test compound, wherein less proliferation or migration 
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in the presence of the test compound is indicative of the test compound being an 
inhibitor of the endothelial cell proliferation or migration. 



30. A method for identifying a therapeutic compound for the treatment or 
prevention of a medical condition characterized by pathologic angiogenesis or 
5 neovascularization, which method comprises: 

incubating test cells in the presence and absence of a test compoimd, 
wherein the test cells express GBS toxin receptor or a fragment thereof having GBS 
toxin receptor activity; 

comparing the proliferation or migration of the test cells incubated in 
1 0 the presence of the test compoimd to the proliferation or migration of the test cells 

incubated in the absence of the test compound, wherein less proliferation or migration 
in the presence of the test compound is indicative of the test compound being a 
candidate therapeutic compound for the treatment or prevention of the medical 
condition. 

15 31. The method of Claim 30, wherein the medical condition is a cancerous 

tiunor. 

32. The method of Claim 30, wherein the medical condition is a reperfiision 

injury. 

33. The method of Claim 30, wherein the medical condition is scarring during 
20 woimd healing. 

34. The method of Claim 30, wherein the medical condition is keloids. 

35. The method of Claim 30, wherein the medical condition is a chronic 
inflammatory disease. 

36. The method of Claim 30, wherein the medical condition is neural injury. 

25 37. A method for identifying a compound which inhibits binding of a GBS 

toxin to a mammalian GBS toxin receptor, comprising: 

(a) simulating and selecting the most probable conformations of a 
mammalian GBS toxin receptor, 
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(b) designing a chemically modified analog that substantially mimics 
the energetically most probable three-dimensional structure of the polypeptide, 

(c) chemically synthesizing the analog, and 

(d) evaluating the bioactivity of the analog. 

5 38. A method for identifying a compound which binds to a mammalian GBS 

toxin receptor, comprising: 

(a) simulating and selecting the most probable conformations of a 
mammalian GBS toxin receptor, 

(b) deducing the most probable binding domains of the polypeptide, 
10 (c) designing a compound that would form the energetically most 

probable complexes with the polypeptide, 

(d) chemically synthesizing the compound, and 

(e) evaluating the bioactivity of the compound. 

39. A method for the prevention or treatment of neonatal onset disease in a 
15 human neonate, comprising administering an inhibitor of binding of GBS toxin to a 

human GBS toxin receptor. 

40. A method for inhibiting pathologic or hypoxia-driven endothelial cell 
proliferation or migration in a mammalian tissue, which method comprises 
specifically binding a molecule to a GBS toxin receptor present on the surface of at 

20 least one cell in the tissue, the molecule being selected from the group consisting of: 

a compound that can evoke an inflammatory response when bound to a 
GBS toxin receptor in a manmial; 

a chimeric compoimd comprising a cytotoxic compound coupled to a 
compoimd that specifically binds the GBS toxin receptor; 
25 an inhibitor of GBS toxin receptor phosphorylation; and 

an inhibitor of GBS toxin receptor activity. 

41. A pHannaceutical composition comprising a pharmaceutically effective 
amount of a molecule selected from the group consisting of: 

a GBS toxin Receptor or fragment thereof; 
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an inhibitor of a GBS toxin receptor; and 

a chimeric compound comprising a cytotoxic agent coupled to a 
compound that binds GBS toxin receptor, 
and- a pharmaceutically acceptable carrier. 



5 



42. A kit comprising a component selected from the group consisting of: 

a GBS toxin receptor or fragment; 

a reagent for detecting the presence of a GBS toxin receptor or 

fragment; and 

10 a reagent for detecting the presence of polynucleotide encoding the 

GBS toxin receptor or fragment. 

43. A molecule for use in a method of treatment of the human or animal body, 
said molecule being selected from the group consisting of: 

a GBS toxin receptor or fragment thereof for use in a method of 
1 5 treatment of the human or animal body, said molecule being selected from the group 
consisting of: 

a GBS toxin receptor or fragment thereof; 
an inhibitor of binding of GBS toxin to a GBS toxin receptor; 
an inhibitor of a GBS toxin receptor; and 
20 a chimeric compound comprising a cytototoxic agent coupled to a 

compound that binds GBS toxin receptor. 

44. Use of an inhibitor of a GBS toxin receptor, or of an inhibitor of binding of 
GBS toxin to a GBS toxin receptor, for the manufacture of a medicament for the 
treatment of a medical condition characterized by pathologic or hypoxia-driven 

25 angiogenesis or neovascularization. 
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HUMAN OVARY 
CANCER + Pab 55 

FIG.^2A 





NORMAL HUMAN 
OVARY + Pab 55 

FIG..2B 
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HUMAN OVARY 
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NORMAL HUMAN 
OVARY + Pab 57 

FIG..3B 

SUBSTITUTE SHEET (RULE 26) 
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MLT CM101 - Biot.5'+ Strep.HRP 



FIG..4A 
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MLT - PBS 5* + Streptavidin - HRP 



FIG..4C 
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sequench: listing 

<:X10> Hellerqvist, Carl 
Fu, Changlin 

<120> GBS Toxin Receptor 

<130> CARB-008/01WO 

<140> 
<141> 

<150> 60-093 , 643 
<151> i998-07-22 

<160> 12 

<170> Patentin Ver. 2.0 

<210> 1 
<211> 2602 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (58) . . (1542) 
<400> 1 

tcgggccggc gctcccttct ctgccaggtg gcgagtacac ctgctcacgt aggcgtc 57 

atg agg tct ccg gtt cga gac ctg gcc egg aac gat ggc gag gag age 105 
Met Arg Ser Pro Val Arg Asp Leu Ala Arg Asn Asp Gly Glu Glu Ser 
15 XO 15 

acg gac cgc acg cct ctt eta ceg ggc gcc cea egg gcc gaa gcc get 153 
Thr Asp Arg Thr Pro Leu Leu Pro Gly Ala Pro Arg Ala Glu Ala Ala 
20 25 30 

cca gtg tgc tgc tct get cgt tac aac tta gea att ttg gee ttt ttt 201 
Pro Val Cys Cys Ser Ala Arg Tyr Asn Leu Ala lie Leu Ala Phe Phe 
35 40 45 

ggt ttc ttc att gtg tat gea tta cgt gtg aat ctg agt gtt gcg tta 249 
Gly Phe Phe He Val Tyr Ala Leu Arg Val Asn Leu Ser Val Ala Leu 
50 55 60 

gtg gat atg gta gat tea aat aca act tta gaa gat aat aga act tec 297 
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Val Asp Met Val Asp Ser Asn Thr Thr Leu Giu Asp Asn Arg Thr Ser 
65 70 75 80 

aag gcg tgt cca gag cat tct get ccc ata aaa get cat cat aat caa 345 
Lys Ala Cys Pro Glu His Ser Ala Pro He Lys Val His His Asn Gin 
85 90 95 

acg ggt aag aag tac caa tgg gat gca gaa act caa gga tgg att etc 393 
Thr Gly Lys Lys Tyr Gin Trp Asp Ala Glu Thr Gin Gly Trp He Leu 
100 105 110 

ggt tee ttt ttt tat ggc tac ate ate aca cag att cct gga gga tat 441 
Gly Ser Phe Phe Tyr Gly Tyr He He Thr Gin He Pro Gly Gly Tyr 
115 120 125 

gtt gee age aaa ata ggg ggg aaa atg etg eta gga ttt ggg ate ctt 489 
Val Ala Ser Lys He Gly Gly Lys Met Leu Leu Gly Phe Gly He Leu 
130 135 140 

ggc act get gte etc ace ctg ttc act eee att get gca gat tta gga 537 
Gly Thr Ala Val Leu Thr Leu Phe Thr Pro He Ala Ala Asp Leu Gly 
145 150 155 160 

gtt gga cca etc att gta etc aga gca eta gaa gga eta gga gag ggt 585 
Val Gly Pro Leu He Val Leu Arg Ala Leu Glu Gly Leu Gly Glu Gly 
165 170 175 

gtt aca ttt cca gee atg cat gee atg tgg tct tct tgg get cce cct 633 
Val Thr Phe Pro Ala Met His Ala Met Trp Ser Ser Trp Ala Pro Pro 
180 185 190 

ctt gaa aga age aaa ctt ctt age att teg tat gca gga gca cag ctt 681 
Leu Glu Arg Ser Lys Leu Leu Ser He Ser Tyr Ala Gly Ala Gin Leu 
195 200 205 

ggg aca gta att tct ctt cct ctt tct gga ata att tge tac tat atg 729 
Gly Thr Val He Ser l»eu Pro Leu Ser Gly He He Cys Tyr Tyr Met 
210 215 220 

aat tgg act tat gtc ttc tac ttt ttt ggt act att gga ata ttt tgg 777 
Asn Trp Thr Tyr Val Phe Tyr Phe Phe Gly Thr He Gly He Phe Trp 
225 230 235 240 

ttt Ctt ttg tgg ate tgg tta gtt agt gac aca cca caa aaa cac aag 825 
Phe Leu Leu Trp He Trp Leu Val Ser Asp Thr Pro Gin Lys His Lys 
245 250 255 

aga att tee cat tat gaa aag gaa tac att ctt tea tea tta aga aat 873 
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Arg lie Ser His Tyr Glu Lys Glu Tyr lie Leu Ser Ser Leu Arg Asn 
260 265 270 

cag cct tct tea cag aag Lca gtg ccg tgg gca ccc act tta aaa tec 921 
Gin Leu Ser Ser Gin Lys Ser Val Pro Trp Val Pro He Leu Lys Ser 
275 280 285 

etg eca ett tgg get ate gta gtt gca cac ttt tct tac aac tgg act 969 
Leu Pro Leu Trp Ala He Val Val Ala His Phe Ser Tyr Asn Trp Thr 
290 295 300 

ttt tat act tta ttg aca tta ttg cct act tat atg aag gag ate eta 1017 
Phe Tyr Thr Leu Leu Thr Leu Leu Pro Thr Tyr Met Lys Glu He Leu 
305 310 315 320 

agg ttc aat gtt caa gag aat ggg ttt tta tct tea ttg cct tat tta 1065 
Arg Phe Asn Val Gin Glu Asn Gly Phe Leu Ser Ser Leu Pro Tyr Leu 
325 330 335 

ggc tct tgg tta tgt atg ate etg tct ggt eaa get get gac aat tta 1113 
Gly Ser Trp Leu Cys Met He Leu Ser Gly Gin Ala Ala Asp Asn Leu 
340 345 350 

agg gca aaa tgg aat ttt tea act tta tgt gtt egc aga att ttt age 1161 
Arg Ala Lys Trp Asn Phe Ser Thr Leu Cys Val Arg Arg He Phe Ser 
355 360 365 

ett ata gga atg att gga cct gca gta ttc etg gta get get ggc ttc 1209 
Leu He Gly Met He Gly Pro Ala Val Phe Leu Val Ala Ala Gly Phe 
370 375 380 

att ggc tgt gat tat tct ttg gcc gtt get ttc eta act ata tea aca 1257 
He Gly Cys Asp Tyr Ser Leu Ala Val Ala Phe Leu Thr He Ser Thr 
365 390 395 400 

aca etg gga ggc ttt tgc tct tct gga ttt age ate aac cat etg gat 1305 
Thr lieu Gly Gly Phe Cys Ser Ser Gly Phe Ser He Asn His Leu Asp 
405 410 415 

att get cct teg tat get ggt ate etc etg ggc ate aca aat aca ttt 1353 
He Ala Pro Ser Tyr Ala Gly He Leu Leu Gly He Thr Asn Thr Phe 
420 425 430 

gee act att cca gga atg gtt ggg ccc gte att get aaa agt etg acc 1401 
Ala Thr He Pro Gly Met Val Gly Pro Val He Ala Lys Ser Leu Thr 
435 440 445 

cet gat aac act gtt gga gaa tgg eaa ace gtg ttc tat att get get 1449 
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Pro Asp Asn Thr Val Gly Giu Trp Gin Thr Val Phe Tyr He Ala Ala 
450 455 460 

gcc att aat gtc zzt ggc gcc act ttc tct aca eta ttc gcc aaa ggt 1497 
Ala He Asn Val Phe Gly Ala He Phe Phe Thr Leu Phe Ala Lys Gly 
465 470 475 480 

gaa gta caa aac egg get etc aat gat eae eat gga cae aga cae 1542 
Glu Val Gin Asn Trp Ala Leu Asn Asp His His Gly His Arg His 
485 490 495 

tgaaggaacc aataaataat cetgcetcta ttaatgtatt tttatttatc atgtaacctc 1602 

aaagtgcctt ctgtattgtg taagcattct atgtcttttt ttaattgtac ttgtattaga 1662 

tttttaaggc ctataatcat gaaatatcac tagttgccag aataataaaa tgaactgtgt 1722 

ttaattatga ataatatgta agctaggact tetactttag gtteacatac ctgcctgcta 1782 

gtcgggcaac acgaagtagg acagttctgt tgatttttta gggccatact aaagggaatg 1842 

agctgaaaca gacctcctga tacctttgct taattaaact agatgataat tctcaggtae 1902 

tgataaacac ctgttgttgt tcactttcct cataaaaatt gtcagctctc tctgacactt 1962 

agacctcaaa ctttagcatc tctgtggagc tgccatccac tgtataattt cgcctggcaa 2022 

ctggactgag gggagtgtgc ccaggcagct gccaagcact ccctccctgg cttcagggtc 2082 

agagtgccca gcgtttatca gaggcagcat ccaagcccag agccagtgtc gactcttcgg 2X42 

ctggtgcctt tcctctgagg ggctatcaat gtgtagataa agccctgagt aggcaagagc 2202 

agtgagatcc actgctatgg tcttgataca tcctcaaact ttcccttccc agcacagagg 2262 

aatattggct ggcatgcaac ctgcaaaaga aaaatgcgaa gcggccgggc acggtggctc 2322 

atgcctgtaa tcccagcact ttggggggct gaggtgggcg aatcatgaga tcaggagttc 2382 

gagaccagcc tggccagcat ggtgaaaccc catctctact aaaaatacaa aaaactagct 2442 

gggcgtggtg acgggcgcct gtaatcccag atactcagga ggctgaggta ggagaatcac 2502 

ttgaacccgg gaggtggaag ttgcagtgaa ccaagatcac gccactgcac tccagcctgg 2562 

gcgatggagc gagactccaa ctcaaaaaaa aaaaaaaaaa 2602 
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<210> 2 
<211> 495 
<212> PRT 

<213> Homo sapiens 
<400> 2 

Met Arg Ser Pro Val Arg Asp Leu Ala Arg Asn Asp Gly Glu Glu Ser 
15 10 15 

Thr Asp Arg Thr Pro Leu Leu Pro Gly Ala Pro Arg Ala Glu Ala Ala 
20 25 30 

Pro Val Cys Cys Ser Ala Arg Tyr Asn Leu Ala lie Leu Ala Phe Phe 
35 40 45 

Gly Phe Phe lie Val Tyr Ala Leu Arg Val Asn Leu Ser Val Ala Leu 
50 55 60 

Val Asp Met Val Asp Ser Asn Thr Thr Leu Glu Asp Asn Arg Thr Ser 
65 70 75 80 

Lys Ala Cys Pro Glu His Ser Ala Pro lie Lys Val His His Asn Gin 
85 90 95 

Thr Gly Lys Lys Tyr Gin Trp Asp Ala Glu Thr Gin Gly Trp lie Leu 
100 105 110 

Gly Ser Phe Phe Tyr Gly Tyr He He Thr Gin He Pro Gly Gly Tyr 
115 120 125 

Val Ala Ser Lys He Gly Gly Lys Met Leu Leu Gly Phe Gly He Leu 
130 135 140 

Gly Thr Ala Val Leu Thr Leu Phe Thr Pro He Ala Ala Asp Leu Gly 
145 150 155 160 

Val Gly Pro Leu He Val Leu Arg Ala Leu Glu Gly Leu Gly Glu Gly 
165 170 175 

Val Thr Phe Pro Ala Met His Ala Met Trp Ser Ser Trp Ala Pro Pro 
180 185 190 

Leu Glu Arg Ser Lys Leu Leu Ser He Ser Tyr Ala Gly Ala Gin Leu 
195 200 205 

Gly Thr Val He Ser Leu Pro Leu Ser Gly He He Cys Tyr Tyr Met 
210 215 220 
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Asn Trp Thr Tyr Val Phe T\'r Phe Phe Gly Thr lie Gly lie Phe Trp 
225 230 235 240 

Phe Leu Leu Trp lie Trp Leu Vai Ser Asp Thr Pro Gin Lys His Lys 
245 250 255 

Arg lie Ser His Tyr Glu Lys Glu Tyr lie Leu Ser Ser Leu Arg Asn 
260 265 270 

Gin Leu Ser Ser Gin Lys Ser Val Pro Trp Val Pro lie Leu Lys Ser 
275 280 285 

Leu Pro Leu Trp Ala lie Val Val Ala His Phe Ser Tyr Asn Trp Thr 
290 295 300 

Phe Tyr Thr Leu Leu Thr Leu Leu Pro Thr Tyr Met Lys Glu lie Leu 
305 310 315 320 

Arg Phe Asn Val Gin Glu Asn Gly Phe Leu Ser Ser Leu Pro Tyr Leu 
325 330 335 

Gly Ser Trp Leu Cys Met He Leu Ser Gly Gin Ala Ala Asp Asn Leu 
340 345 350 

Arg Ala Lys Trp Asn Phe Ser Thr I^u Cys Val Arg Arg He Phe Ser 
355 360 365 

Leu He Gly Met He Gly Pro Ala Val Phe Leu Val Ala Ala Gly Phe 
370 375 380 

He Gly Cys Asp Tyr Ser Leu Ala Val Ala Phe Leu Thr He Ser Thr 
385 390 395 400 

Thr Leu Gly Gly Phe Cys Ser Ser Gly Phe Ser He Asn His Leu Asp 
405 410 415 

He Ala Pro Ser Tyr Ala Gly He Leu Leu Gly He Thr Asn Thr Phe 
420 425 430 

Ala Thr He Pro Gly Met Val Gly Pro Val He Ala Lys Ser Leu Thr 
435 440 445 

Pro Asp Asn Thr Val Gly Glu Trp Gin Thr Val Phe Tyr He Ala Ala 
450 455 460 

Ala He Asn Val Phe Gly Ala He Phe Phe Thr Leu Phe Ala Lys Gly 
465 470 475 480 
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Glu Val Gin Asn Trp Ala Leu Asn Asp His His Giy His Arg His 
485 490 



<210>"3 
<211> 2844 
<212> DNA 
<213> Ovis sp. 



<220> 
<221> CDS 

<222> (84) , . (1568) 



<400> 3 

cccgggggcg gggggcttcg gcggtcccgc tggagctctc ttttccgcgg agcaggtttg 60 

cgccgtagct ccctgaaggc ate atg aag tec ccg gtt teg gae tta gee ceg 113 

Met Lys Ser Pro Val Ser Asp Leu Ala Pro 
1 5 10 

age gae gge gag gag gge teg gae ege aca cbg ete etg cag egc gee 161 
Ser Asp Gly Glu Glu Gly Ser Asp Arg Thr Pro Leu Leu Gin Arg Ala 
15 20 25 

ceg egg geg gaa eee get cca gta tge tge tet get egt tac aac eta 209 
Pro Arg Ala Glu Pro Ala Pro Val Cys Cys Ser Ala Arg Tyr Asn Leu 
30 35 40 



gea ttt ttg tee ttt ttt ggt tte tte gtt ete tat tea tta egg gtg 257 
Ala Phe Leu Ser Phe Phe Gly Phe Phe Val Leu Tyr Ser Leu Arg Val 
45 50 55 



aat etg age gtt gca eta gtg gae atg gtg gat tea aac aca act gee 305 
Asn Leu Ser Val Ala Leu Val Asp Met Val Asp Ser Asn Thr Thr Ala 
60 65 70 

aaa gat aat aga acg tec tac gag tgt gca gag cat tet get ccc ata 353 
Lys Asp Asn Arg Thr Ser Tyr Glu Cys Ala Glu His Ser Ala Pro lie 
75 80 85 90 

aaa gtt ctt cac aac caa acg ggt aaa aag tac egg tgg gat gca gaa 401 
Lys Val Leu His Asn Gin Thr Gly Lys Lys Tyr Arg Trp Asp Ala Glu 
95 . 100 105 

act caa gga tgg att etc gga tet ttt tte tat gge tac ate ate aca 449 
Thr Gin Gly Trp lie Leu Gly Ser Phe Phe Tyr Gly Tyr lie He Thr 
110 115 120 
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caa azc czz gga gga cat gtz gcc age aga age ggg ggg aag ctg etc 497 
Gin lie Pro Gly Gly Tyr Val Ala Ser Arg Ser Gly Gly Lys Leu Leu 
125 130 135 

cca gga ttc ggg acc ttt get aca get ate ttc ace ctg tte aet ccc 545 
Leu Gly Phe Gly lie Phe Ala Thr Ala lie Phe Thr Leu Phe Thr Pro 
140 145 150 

ete get gca gat tte gga gtc gga gee ctt gtt gca etc agg gca eta 593 
Leu Ala Ala Asp Phe Gly Val Gly Ala Leu Val Ala Leu Arg Ala Leu 
155 160 165 170 

gaa ggg eta gga gag ggt gtc aca tat cca gcc atg cat gcc atg tgg 641 
Glu Gly Leu Gly Glu Gly Val Thr Tyr Pro Ala Met His Ala Met Trp 
175 180 185 

tct tea tgg get ccc cct ctt gaa aga age aag ctt ctg agt att tea 689 
Ser Ser Trp Ala Pro Pro Leu Glu Arg Ser Lys Leu Leu Ser lie Ser 
190 195 200 

tat gca gga gca caa ctt ggg aca gta gtt tct ctt cct ctt tct gga 737 
Tyr Ala Gly Ala Gin Leu Gly Thr Val Val Ser Leu Pro Leu Ser Gly 
205 210 2X5 

gta att tgc tac tat atg aat tgg act tat gtc ttc tat ttc ttt ggc 785 
Val lie Cys Tyr Tyr Met Asn Trp Thr Tyr Val Phe Tyr Phe Phe Gly 
220 225 230 

att gtt gga ate ate tgg ttt att tta tgg ate tgc tta gtt agt gat 833 
He Val Gly He He Trp Phe He Leu Trp He Cys Leu Val Ser Asp 
235 240 245 250 

aca cca gaa act cac aag aca ate act ccc tat gaa aag gag tat att 881 
Thr Pro Glu Thr His Lys Thr He Thr Pro Tyr Glu Lys Glu Tyr He 
255 260 265 

ctt tea tea tta aaa aat cag etc tct tea cag aag tea gtg ceg tgg 929 
Leu Ser Ser Leu Lys Asn Gin Leu Ser Ser Gin Lys Ser Val Pro Trp 
270 275 280 

ata cct atg ctg aaa tea ctg cca ctt tgg get att gtc gtt gca cat 977 
He Pro Met Leu Lys Ser Leu Pro Leu Trp Ala He Val Val Ala His 
285 290 295 

ttt tct tac aac tgg act ttt tat aet ttg ttg acc tta ttg cct act 1025 
Phe Ser Tyr Asn Trp Thr Phe Tyr Thr Leu Leu Thr Leu Leu Pro Thr 
300 305 310 
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tac atg aag gaa gcc eta acg etc aac art caa gag aat ggg ttt tea 1073 
Tyr Met Lys Glu Val Leu Arg Phe Asn lie Gin Glu Asn Gly Phe Leu 
315 320 325 330 

tct gca gtc cct tat tta ggt tgt egg tta tgt atg ate ctg teg ggt 1121 
Ser Ala Val Pro Tyr Leu Gly Cys Trp Leu Cys Met lie Leu Ser Gly 
335 340 345 

eaa get get gac aat tta agg gea aga tgg aat ttt tea act etg egg 1169 
Gin Ala Ala Asp Asn Leu Arg Ala Arg Trp Asn Phe Ser Thr Leu Trp 
350 355 360 

gtt cga aga gtt ttt age ett ata ggg atg att gga cct gcg ata ttc 1217 
Val Arg Arg Val Phe Ser Leu lie Gly Met lie Gly Pro Ala He Phe 
365 370 375 

ctg gtt gcc gca gga ttt ata ggc tgt gat tat tec ttg get gtt gca 1265 
Leu Val Ala Ala Gly Phe He Gly Cys Asp Tyr Ser Leu Ala Val Ala 
380 385 390 

ttc eta ace ata tea aca ace ctg gga ggc ctt tge tct tct gga ttt 1313 
Phe Leu Thr He Ser Thr Thr Leu Gly Gly Phe Cys Ser Ser Gly Phe 
395 400 405 410 

age ate aac cat ctg gac att get cct teg tat get ggt att etc ctg 1361 
Ser He Asn His Leu Asp He Ala Pro Ser Tyr Ala Gly He Leu Leu 
415 420 425 

ggc ate aca aat acc ttt gee act att cct gga atg att ggg cec ate 1409 
Gly He Thr Asn Thr Phe Ala Thr He Pro Gly Met He Gly Pro He 
430 435 440 

att gcc aga agt ctt acc cct gag aac act att gga gaa tgg caa act 1457 
He Ala Arg Ser Leu Thr Pro Glu Asn Thr He Gly Glu Trp Gin Thr 
445 450 455 

gtt ttc tgc ate get get get ate aat gta ttt ggt gcc att ttc ttc 1505 
Val Phe Cys He Ala Ala Ala He Asn Val Phe Gly Ala He Phe Phe 
460 465 470 

aca eta ttc gcc aaa ggt gaa gtg caa aac tgg gcc ate agt gat cac 1553 
Thr Leu Phe Ala Lys Gly Glu Val Gin Asn Trp Ala He Ser Asp His 
475 480 485 490 

caa gga cac aga aac tgaaggaacc aataaataat cctgtctcta ttaatgtatc 1608 
Gin Gly His Arg Asn 
495 
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cccgcrcatc acgtaaccta aaagcgccrr tgacatttta atgtgcaagc aatctacaca 1668 
caagacaaaa ccgtactaga aaaatcgcgt cagatttgta aggcttgtaa tcatgaaacg 1728 
tcaccagttg ccatataagc aaaattagcc acttttaatt attattaacc cgtccgccgg 1788 
aacttacaac ccagggtcac atacctggcr gcaagtcagg caacccacaa taggggagtt 1848 
ctatttattt ataagaccat acctaaagag atgagctgaa atagaccctt ctatacctct 1908 
gctcaattaa ggcggataac aattctcagg tcttgttaaa catctgtttt tgcacacctt 1968 
cctcaaaaaa ttatttgtca tcagcaatcc ctgacatgta ggtctcaaac tttagcctct 2028 
ccacggagct ggcagccact gtatcattca gcctggcaac ttcactgagg gaagcatgcc 2088 
caggcagctg ccacatgtcc cctctctggc ttcagggaca gtgcccagca cttaggcagc 214 8 
atccaagacc agggtcagcg ccaaggcttt ggacggtatt cttcccctgg ggctgttaat 2208 
gtgtggatga agccctgagc caacagggac agcgcgatcc acagtcatgg tttccatgca 2268 
ccctctccct tcccttccca gcacactgga gtattgcctg gcatgtaacc tgcaaaagaa 2328 
agtgtgatgc ctaattagcc acatataaca tcatccttga tgatcctacc ttcacatgga 2388 
tcagagtata aatcttcaag tcctgtgttc taggagctac accagaataa ttaaaatata 2448 
aaaagaaaca aaacattttt tctgtctgac acctaagtgt ctggttgcag ttcaaggtta 2508 
aagtgacttc tacttcacat aacctgcaac cggtggtgta atcatcttta gtgctggttc 2568 
cttaaatctt atttttccag tttttcctgg accatcttcc agtggttttg agcatgcttt 2628 
gagggcattt atgtgattta gaacttgatt aatgtttcac tgtgtatgtt caacactacc 2686 
tgtaatattt taactaaagc tatttaatgt aatatgatgt gtatacattc tgtaaattaa 2748 
tttttaaatc tgtaaatagc tttaagttgc tatggtgata tttcttttac aaatcaaaat 2808 
aaatcttttt ggaacgataa aaaaaaaaaa aaaaaa 2844 



<210> 4 
<211> 495 
<212> PRT 
<213> Ovis sp. 
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<400> 4 

Met Lys Ser Pro 
1 

Ser Asp Arg Thr 
20 

Pro Val Cys Cys 
35 

Gly Phe Phe Val 
50 

Val Asp Met Val 
65 

Tyr Glu Cys Ala 



Thr Gly Lys Lys 
100 

Gly Ser Phe Phe 
115 

Val Ala Ser Arg 
130 

Ala Thr Ala He 
145 

Val Gly Ala Leu 



Val Thr Tyr Pro 
180 

Leu Glu Arg Ser 
195 

Gly Thr Val Val 
210 

Asn Trp Thr Tyr 
225 

Phe He Leu Trp 



Val Ser Asp Leu 
5 

Pro Leu Leu Gin 



Ser Ala Arg Tyr 
40 

Leu Tyr Ser Leu 
55 

Asp Ser Asn Thr 
70 

Glu His Ser Ala 
85 

Tyr Arg Trp Asp 



Tyr Gly Tyr He 
120 

Ser Gly Gly Lys 
135 

Phe Thr Leu Phe 
150 

Val Ala Leu Arg 
165 

Ala Met His Ala 



Lys Leu Leu Ser 
200 

Ser Leu Pro Leu 
215 

Val Phe Tyr Phe 
230 

He Cys Leu Val 
245 

11 



Ala Pro Ser Asp 
10 

Arg Ala Pro Arg 
25 

Asn Leu Ala Phe 



Arg Val Asn Leu 
60 

Thr Ala Lys Asp 
75 

Pro He Lys Val 
90 

Ala Glu Thr Gin 
105 

He Thr Gin He 



Leu Leu Leu Gly 
140 

Thr Pro Leu Ala 
155 

Ala Leu Glu Gly 
170 

Met Trp Ser Ser 
185 

He Ser Tyr Ala 



Ser Gly Val He 
220 

Phe Gly He Val 
235 

Ser Asp Thr Pro 
250 



Gly Glu Glu Gly 
15 

Ala Glu Pro Ala 
30 

Leu Ser Phe Phe 
45 

Ser Val Ala Leu 



Asn Arg Thr Ser 
80 

Leu His Asn Gin 
95 

Gly Trp He Leu 
HO 

Pro Gly Gly Tyr 
125 

Phe Gly He Phe 



Ala Asp Phe Gly 
160 

Leu Gly Glu Gly 
175 

Trp Ala Pro Pro 
190 

Gly Ala Gin Leu 
205 

Cys Tyr Tyr Met 



Gly He He Trp 
240 

Glu Thr His Lys 
255 
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Thr lie Thr Pro Tyr Glu Lys Giu Tyr lie Leu Ser Ser Leu Lys Asn 
260 265 270 

Gin Leu Ser Ser Gin Lys Ser Val Pro Trp He Pro Met Leu Lys Ser 
275 280 285 

Leu Pro Leu Trp Ala He Val Val Ala His Phe Ser Tyr Asn Trp Thr 
290 295 300 

Phe Tyr Thr Leu Leu Thr Leu Leu Pro Thr Tyr Met Lys Glu Val Leu 
305 310 315 320 

Arg Phe Asn He Gin Glu Asn Gly Phe Leu Ser Ala Val Pro Tyr Leu 
325 330 335 

Gly Cys Trp Leu Cys Met He Leu Ser Gly Gin Ala Ala Asp Asn Leu 
340 345 350 

Arg Ala Arg Trp Asn Phe Ser Thr Leu Trp Val Arg Arg Val Phe Ser 
355 360 365 

Leu He Gly Met He Gly Pro Ala He Phe Leu Val Ala Ala Gly Phe 
370 375 380 

He Gly Cys Asp Tyr Ser Leu Ala Val Ala Phe Leu Thr He Ser Thr 
385 390 395 400 

Thr Leu Gly Gly Phe Cys Ser Ser Gly Phe Ser He Asn His Leu Asp 
405 410 415 

He Ala Pro Ser Tyr Ala Gly He Leu Leu Gly He Thr Asn Thr Phe 
420 425 430 

Ala Thr He Pro Gly Met He Gly Pro He He Ala Arg Ser Leu Thr 
435 440 445 

Pro Glu Asn Thr He Gly Glu Trp Gin Thr Val Phe Cys He Ala Ala 
450 455 460 

Ala He Asn Val Phe Gly Ala He Phe Phe Thr Leu Phe Ala Lys Gly 
465 470 475 480 

Glu Val Gin Asn Trp Ala He Ser Asp His Gin Gly His Arg Asn 
485 490 495 



<210> 5 

12 
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<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 

<400> 5 

cgggatcccg ccngcnatgc ayrshrtstg g 

<210> 6 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 6 

ggaattccdg gdgcratktc narrtrrtt 



<210> 7 

<211> 2930 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (263) . , (1870) 
<400> 7 

gttcggtcga agccctcccc ttaattatgt gcaattcaag tccccactgc ccgcccgcaa 60 

gcccccactc atcctcgctg cgggcagggt ggcccctgca ctttacaagg gggtgcagga 120 

gcgggagacg gtcgtccgaa caccggctcc ccggcatgcg tagaccggcg ggcggagcgg 180 

gctcactttg cgccaatcct acgagaactc ccagaactcc gcttccctag tccaacccaa 240 

gccagagttg cccacaccta ag atg gcg gcg ggg gcg atg aca ccg ccc cgc 292 

Met Ala Ala Gly Ala Met Thr Pro Pro Arg 
15 10 

ccg gtc cag cca get egg cce ggg gge ttc ggg ctg teg ggc egg cgc 340 
Pro Val Gin Pro Ala Arg Pro Gly Gly Phe Gly Leu Ser Gly Arg Arg 
15 20 25 

13 
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tec ccc etc cgc cag gcg gcg age aca cct gcc cac gca ggc gtc acg 388 
Ser Leu Leu Cys Gin Val Ala Ser Thr Pro Ala His Val Gly Val Met 
• 30 35 40 

agg tct ccg gtt cga gac ctg gcc egg aac gat ggc gag gag age acg 436 
Arg Ser Pro Val Arg Asp Leu Ala Arg Asn Asp Gly Glu Glu Ser Thr 
45 50 55 



gac cgc acg cct ett eta ccg ggc gcc cea egg gcc gaa gee get cca 
Asp Arg Thr Pro Leu Leu Pro Gly Ala Pro Arg Ala Glu Ala Ala Pro 
60 65 70 



gat atg gta gat tea aat aca act tta gaa gat aat aga act tec aag 
Asp Met Val Asp Ser Asn Thr Thr Leu Glu Asp Asn Arg Thr Ser Lys 
110 115 120 



484 



gtg tgc tgc tct get cgt tac aac tta gca att ttg gcc ttt ttt ggt 532 
Val Cys Cys Ser Ala Arg Tyr Asn Leu Ala lie Leu Ala Phe Phe Gly 
75 80 85 90 

ttc ttc att gtg tat gea tta cgt gtg aat ctg agt gtt gcg tta gtg 580 
Phe Phe He Val Tyr Ala Leu Arg Val Asn Leu Ser Val Ala Leu Val 
95 100 105 



628 



gcg tgt cca gag cat tct get ccc ata aaa gtt cat cat aat caa acg 676 
Ala Cys Pro Glu His Ser Ala Pro He Lys Val His His Asn Gin Thr 
125 130 135 

ggt aag aag tac caa tgg gat gca gaa act caa gga tgg att etc ggt 724 
Gly Lys Lys Tyr Gin Trp Asp Ala Glu Thr Gin Gly Trp lie Leu Gly 
140 145 150 

tec ttt ttt tat ggc tac ate ate aca cag att cct gga gga tat gtt 772 
Ser Phe Phe Tyr Gly Tyr He He Thr Gin He Pro Gly Gly Tyr Val 
ISS 160 165 170 

gcc age aaa ata ggg ggg aaa atg ctg eta gga ttt ggg ate ctt ggc 820 
Ala Ser Lys He Gly Gly Lys Met Leu Leu Gly Phe Gly He Leu Gly 
175 180 185 

act get gtc etc ace ctg ttc act ccc att get gea gat tta gga gtt 868 
Thr Ala Val Leu Thr Leu Phe Thr Pro He Ala Ala Asp Leu Gly Val 
190 195 200 

gga cca etc att gta etc aga gea eta gaa gga eta gga gag ggt gtt 916 
Gly Pro lieu He Val Leu Arg Ala Leu Glu Gly Leu Gly Glu Gly Val 
205 210 215 
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aca tzz cca gcc atg cat gcc acg tgg ccc tct egg get ccc cct ctt 964 

Thr Phe Pro Ala Met His Ala Met Trp Ser Ser Trp Ala Pro Pro Leu 

220 225 230 

gaa aga age aaa ett ctt age att teg tat gca gga gea cag ctt ggg 1012 

Glu Arg Ser Lys Leu Leu Ser lie Ser Tyr Ala Gly Ala Gin Leu Gly 
235 240 245 250 

aca gta att tct ctt cct ctt tct gga ata att tgc tac tat atg aat 1060 

Thr Val lie Ser Leu Pro Leu Ser Gly He He Cys Tyr Tyr Met Asn 
255 260 265 



tgg act tat gtc ttc tac ttt ttt ggt act att gga ata ttt tgg ttt 1108 
Trp Thr Tyr Val Phe Tyr Phe Phe Gly Thr He Gly He Phe Trp Phe 
270 275 280 



ctt ttg tgg ate tgg tta gtt agt gac aca cca caa aaa cac aag aga 1156 
Leu Leu Trp He Trp Leu Val Ser Asp Thr Pro Gin Lys His Lys Arg 
285 290 295 

att tec cat tat gaa aag gaa tac att ctt tea tea tta aga aat cag 1204 
He Ser His Tyr Glu Lys Glu Tyr He Leu Ser Ser Leu Arg Asn Gin 
300 305 310 



ctt tct tea cag aag tea gtg ccg tgg 
Leu Ser Ser Gin Lys Ser Val Pro Trp 
315 320 

cea ctt tgg get ate gta gtt gca cac 
Pro Leu Trp Ala He Val Val Ala His 
335 

tat act tta ttg aca tta ttg cct act 
Tyr Thr Leu Leu Thr Leu Leu Pro Thr 
350 355 



gta ccc att tta aaa tec ctg 1252 

Val Pro He Leu Lys Ser Leu 
325 330 

ttt tct tac aac tgg act ttt 1300 

Phe Ser Tyr Asn Trp Thr Phe 

340 345 

tat atg aag gag ate eta agg 1348 

Tyr Met Lys Glu He Leu Arg 
360 



ttc aat gtt caa gag aat ggg ttt tta tct tea ttg cct tat tta gge 1396 
Phe Asn Val Gin Glu Asn Gly Phe Leu Ser Ser Leu Pro Tyr Leu Gly 
365 370 375 



tet tgg tta tgt atg ate ctg tct ggt caa get get gac aat tta agg 1444 

Ser Trp Leu Cys Met He Leu Ser Gly Gin Ala Ala Asp Asn Leu Arg 

380 385 390 

gea aaa tgg aat ttt tea act tta tgt gtt cgc aga att ttt age ctt 1492 

Ala Lys Trp Asn Phe Ser Thr Leu Cys Val Arg Arg He Phe Ser Leu 
395 400 405 410 
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ata gga acg act gga cct gca gta ttc ccg gta get get ggc ttc att 154c 
He Gly Met He Gly Pro Ala Val Phe Leu Val Ala Ala Gly Phe He 
415 420 425 

ggc tgt gat tat tct ttg gcc gtt get ttc eta act ata tea aca aca 1588 
Gly Cys Asp Tyr Ser Leu Ala Val Ala Phe Leu Thr He Ser Thr Thr 
430 435 440 

ctg gga ggc ttt tgc tct tct gga ttt age ate aac cat ctg gat att 1636 
Leu Gly Gly Phe Cys Ser Ser Gly Phe Ser He Asn His Leu Asp lie 
445 450 455 

get cct teg tat get ggt ate etc ctg ggc ate aca aat aca ttt gee 1684 
Ala Pro Ser Tyr Ala Gly He Leu Leu Gly He Thr Asn Thr Phe Ala 
460 465 470 

act att eca gga atg gtt ggg cec gtc att get aaa agt ctg acc cct 1732 
Thr He Pro Gly Met Val Gly Pro Val He Ala Lys Ser Leu Thr Pro 
475 480 485 490 

gat aac act gtt gga gaa tgg caa acc gtg ttc tat att get get get 1780 
Asp Asn Thr Val Gly Glu Trp Gin Thr Val Phe Tyr He Ala Ala Ala 
495 ' 500 505 



att aat gtt ttt ggt gcc att tte ttt aca eta ttc gee aaa ggt gaa 1828 
He Asn Val Phe Gly Ala He Phe Phe Thr Leu Phe Ala Lys Gly Glu 
510 515 520 

gta caa aac tgg get etc aat gat cac eat gga cac aga eac 1870 
Val Gin Asn Trp Ala Leu Asn Asp His His Gly His Arg His 
525 530 535 

tgaaggaace aataaataat cetgectcta ttaatgtatt tttatttate atgtaacctc 1930 

aaagtgcctt ctgtattgtg taagcattet atgtettttt ttaattgtac ttgtattaga 1990 



tttttaaggc ctataatcat gaaatatcae 
ttaattatga ataatatgta agetaggaet 
gtcgggcaae atgaagtagg acagttetgt 
agctgaaaca gacctectga taectttget 



tagttgccag aataataaaa tgaactgtgt 2050 
tctaetttag gtteacatac etgcctgcta 2110 
tgatttttta gggecatact aaagggaatg 2170 
taattaaaet agatgataat tetcaggtac 2230 



tgataaacae etgttgttgt tcaetttcet eataaaaatt gtcagctcte tctgacaett 2290 
agacetcaaa ctrtagcatc tctgtggage tgccatccac tgtataattt cgectggcaa 2350 

16 
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ctggactgag gggagtgtgc ccaggcagct gccaagcact ccctccctgg cttcagggcc 2410 
agagtgccca gcgtttatca gaggcagcat ccaagcccag agccagtgtc gactctccgg 2470 
ctggtgcctc tcctctgagg ggctatcaat gtgtagataa agccctgagt aggcaagagc 2530 
agtgagatcc actgctatgg tcttgacaca tcctcaaact ttcccttccc agcacagagg 2590 
aatattggct ggcacgcaac ctgcaaaaga aaaatgcgaa gcggccgggc acggtggctc 2650 
atgcctgtaa tcccagcact ttggggggct gaggtgggcg aatcatgaga tcaggagttc 2710 
gagaccagcc tggccagcat ggtgaaaccc catctctact aaaaatacaa aaaattagct 2770 
gggcgtggtg acgggcgcct gtaatcccag atactcagga ggctgaggta ggagaatcac 2830 
ttgaacctgg gaggtggaag ttgcagtgaa ccaagatcac gccactgcac tccagcctgg 2 890 
gcgacggagc gagactccaa ctcaaaaaaa aaaaaaaaaa 2930 



<210> 8 
<211> 536 
<212> PRT 

<213> Homo sapiens 
<400> 8 

Met Ala Ala Gly Ala Met Thr Pro Pro Arg Pro Val Gin Pro Ala Arg 
15 10 15 

Pro Gly Gly Phe Gly Leu Ser Gly Arg Arg Ser Leu lieu Cys Gin Val 
20 25 30 

Ala Ser Thr Pro Ala His Val Gly Val Met Arg Ser Pro Val Arg Asp 
35 40 45 

Lieu Ala Arg Asn Asp Gly Glu Glu Ser Thr Asp Arg Thr Pro Leu Leu 
50 55 60 

Pro Gly Ala Pro Arg Ala Glu Ala Ala Pro Val Cys Cys Ser Ala Arg 
65 70 75 80 

Tyr Asn Leu Ala lie l-eu Ala Phe Phe Gly Phe Phe lie Val Tyr Ala 
85 90 95 

Leu Arg Val Asn Leu Ser Val Ala Leu Val Asp Met Val Asp Ser Asn 
100 105 110 

17 
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Thr Thr Leu Glu Asp Asn Arg Thr Ser Lys Ala Cys Pro Glu His Ser 
115 120 125 

Ala Pro lie Lys Val Lis His Asn Gin Thr Gly Lys Lys Tyr Gin Trp 
130 135 140 

Asp Ala Glu Thr Gin Gly Trp lie Leu Gly Ser Phe Phe Tyr Gly Tyr 
145 150 155 160 

lie lie Thr Gin lie Pro Gly Gly Tyr Val Ala Ser Lys lie Gly Gly 
165 170 175 

Lys Met Leu Leu Gly Phe Gly lie Leu Gly Thr Ala Val Leu Thr Leu 
180 185 190 

Phe Thr Pro lie Ala Ala Asp Leu Gly Val Gly Pro Leu He Val Leu 
195 200 205 

Arg Ala Leu Glu Gly Leu Gly Glu Gly Val Thr Phe Pro Ala Met His 
210 215 220 

Ala Met Trp Ser Ser Trp Ala Pro Pro Leu Glu Arg Ser Lys Leu Leu 
225 230 235 240 

Ser He Ser Tyr Ala Gly Ala Gin Leu Gly Thr Val He Ser Leu Pro 
245 250 255 

Leu Ser Gly He He Cys Tyr Tyr Met Asn Trp Thr Tyr Val Phe Tyr 
260 265 270 

Phe Phe Gly Thr He Gly He Phe Trp Phe Leu Leu Trp He Trp I^eu 
275 280 285 

Val Ser Asp Thr Pro Gin Lys His Lys Arg He Ser His Tyr Glu Lys 
290 295 300 

Glu Tyr He Leu Ser Ser Leu Arg Asn Gin Leu Ser Ser Gin Lys Ser 
305 310 315 320 

Val Pro Trp Val Pro He Leu Lys Ser Leu Pro Leu Trp Ala He Val 
325 330 335 

Val Ala His Phe Ser Tyr Asn Trp Thr Phe Tyr Thr Leu Leu Thr Leu 
340 345 350 

Leu Pro Thr Tyr Met Lys Glu He Leu Arg Phe Asn Val Gin Glu Asn 
355 360 365 

18 



BNSDOCID: <WO_0005375A1_IA> 



wo 00/05375 



PCT/US99/16676 



Gly Phe Leu Ser Ser Leu Pro Tyr Leu Giy Ser Trp Leu Cys Met lie 
370 375 380 

Leu Ser Gly Gin Ala Ala Asp. Asn Leu Arg Ala Lys Trp Asn Phe Ser 
385 390 395 400 

Thr Leu Cys Val Arg Arg lie Phe Ser Leu lie Gly Met lie Gly Pro 
405 410 415 

Ala Val Phe Leu Val Ala Ala Gly Phe lie Gly Cys Asp Tyr Ser Leu 
420 425 430 

Ala Val Ala Phe Leu Thr lie Ser Thr Thr Leu Gly Gly Phe Cys Ser 
435 440 445 

Ser Gly Phe Ser lie Asn His Leu Asp lie Ala Pro Ser Tyr Ala Gly 
450 455 460 

lie Leu Leu Gly lie Thr Asn Thr Phe Ala Thr He Pro Gly Met Val 
465 470 475 480 

Gly Pro Val He Ala Lys Ser Leu Thr Pro Asp Asn Thr Val Gly Glu 
485 490 495 

Trp Gin Thr Val Phe Tyr He Ala Ala Ala He Asn Val Phe Gly Ala 
500 505 510 

He Phe Phe Thr Leu Phe Ala Lys Gly Glu Val Gin Asn Trp Ala Leu 
515 520 525 

Asn Asp His His Gly His Arg His 
530 535 



<210> 9 
<211> 1485 
<212> ONA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: human/sheep 
consensus sequence 

<220> 

<221> CDS 

<222> (1) . . (1485) 
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<400> 9 

atg arg ccy ccg gtt ysr gac ytr gcc csg arc gay ggc gag gag rgc 48 

Met Xaa Xaa Pro Val Xaa Asp Xaa Ala Xaa Xaa Xaa Gly Glu Glu Xaa 
1 _ 5 10 15 

wcg gac cgc acr cck ccy ctr cmg sgc gcc ccr egg gcs gaa sec get 96 
Xaa Asp Arg Xaa Xaa Xaa Xaa Xaa Xaa Ala Xaa Arg Xaa Glu Xaa Ala 
20 25 30 

cca gtr tge tgc tct get cgt tac aac yta gca wtt ttg kcc ttt ttt 144 
Pro Xaa Cys Cys Ser Ala Arg Tyr Asn Xaa Ala Xaa Leu Xaa Phe Phe 
35 40 45 

ggt ttc ttc rtt sts tat kca tta egk gtg aat ctg agy gtt ger yta 192 
Gly Phe Phe Xaa Xaa Tyr Xaa Leu Xaa Val Asn Leu Xaa Val Xaa Xaa 
50 55 60 

gtg gay atg gtr gat tea aay aca act kym raa gat aat aga ack tec 240 
Val Xaa Met Xaa Asp Ser Xaa Thr Thr Xaa Xaa Asp Asn Arg Xaa Ser 
65 70 75 80 

was gmg tgt sea gag cat tet get ccc ata aaa gtt cwt cay aay caa 286 
Xaa Xaa Cys Xaa Glu His Ser Ala Pro lie Lys Val Xaa Xaa Xaa Gin 
85 90 95 

aeg ggt aar aag tac err tgg gat gca gaa act caa gga tgg att etc 336 
Thr Gly Xaa Lys Tyr Xaa Trp Asp Ala Glu Thr Gin Gly Trp lie Leu 
100 105 110 

ggw tcy ttt tty tat ggc tac ate ate aca car att cct gga gga tat 384 
Xaa Xaa Phe Xaa Tyr Gly Tyr lie lie Thr Xaa lie Pro Gly Gly Tyr 
115 120 125 

gtt gcc age ara akw ggg ggg aar mtg ytg eta gga tty ggg ate ytt 432 
Val Ala Ser Xaa Xaa Gly Gly Xaa Xaa Xaa Leu Gly Xaa Gly lie Xaa 
130 135 140 

gsy acw get rtc ytc ace ctg ttc act ccc mty get gca gat ttm gga 480 
Xaa Xaa Ala Xaa Xaa Thr Leu Phe Thr Pro Xaa Ala Ala Asp Xaa Gly 
145 150 155 160 

gty gga scm cty rtt gya etc agr gca eta gaa ggr eta gga gag ggt 528 
Xaa Gly Xaa Xaa Xaa Xaa Leu Xaa Ala Leu Glu Xaa Leu Gly Glu Gly 
165 170 175 

gty aca twt cca gcc atg eat gcc atg tgg tct tew tgg get ccc cct 576 
xaa Thr Xaa Pro Ala Met His Ala Met Trp Ser Xaa Trp Ala Pro Pro 
180 185 190 
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ctt gaa aga age aar ctt ctk agy act ccr cac gca gga gca car ctt 624 

Leu Glu Arg Ser Xaa Leu Xaa Xaa lie Xaa Tyr Ala Gly Ala Xaa Leu 
195 200 205 

ggg aca gta rtt tct ctt cct ctt tec gga rta act tgc tac tat atg 672 

Gly Thr Val Xaa Ser Leu Pro Leu Ser Gly Xaa lie Cys Tyr Tyr Met 
210 215 220 

aat tgg act tat gtc ttc tay tty ttt ggy ayt rtt gga acm wty tgg 720 

Asn Trp Thr Tyr Val Phe Xaa Xaa Phe Xaa Xaa Xaa Gly Xaa Xaa Trp 

225 230 235 240 



ttt mtt ttr tgg ate tgs tta gtt agt gay aca cca saa amw cac aag 768 

Phe Xaa Xaa Trp lie Xaa Leu Val Ser Xaa Thr Pro Xaa Xaa His Lys 

245 2S0 255 

asa aty wcy cmy tat gaa aag gar tay att ctt tea tea tta ara aat 816 

Xaa Xaa Xaa Xaa Tyr Glu Lys Xaa Xaa lie Leu Ser Ser Leu Xaa Asn 

260 265 270 



cag cty tct tea cag aag tea gtg ccg tgg rta ccy atk ytr aaa tern 864 

Gin Xaa Ser Ser Gin Lys Ser Val Pro Trp Xaa Xaa Xaa Xaa Lys Xaa 
275 '280 285 

ctg cca ctt tgg get aty gtm gtt gca cay ttt tct tac aac tgg act 912 

Leu Pro Leu Trp Ala Xaa Xaa Val Ala Xaa Phe Ser Tyr Asn Trp Thr 
290 295 300 

ttt tat act ttr ttg acm tta ttg cct act tay atg aag gar rtc eta 960 

Phe Tyr Thr Xaa Leu Xaa I*eu Leu Pro Thr Xaa Met Lys Xaa Xaa Leu 
305 310 315 320 



agg ttc aat rtt caa gag aat ggg ttt tta tct kca kts cct tat tta 1008 
Arg Phe Asn Xaa Gin Glu Asn Gly Phe Leu Ser Xaa Xaa Pro Tyr Leu 
325 330 335 

ggy tst tgg tta tgt atg ate ctg tck ggt caa get get gac aat tta 1056 
Xaa Xaa Trp Leu Cys Met lie Leu Xaa Gly Gin Ala Ala Asp Asn Leu 
340 345 350 



agg gca ara tgg aat ttt tea act ytr tgk gtt egm aga rtt ttt age 1104 
Arg Ala Xaa Trp Asn Phe Ser Thr Xaa Xaa Val Xaa Arg Xaa Phe Ser 
355 360 365 

Ctt ata ggr atg att gga cct ger rta ttc ctg gtw gey gew ggm tty 1152 
Leu lie Xaa Met He Gly Pro Xaa Xaa Phe Leu Xaa Xaa Xaa Xaa Xaa 
370 375 380 
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acw ggc tgt gat tat tcy ttg gey gtc 
Xaa Gly Cys Asp Tyr Xaa Leu Xaa Val 
385 390 

acm ctg gga ggc ttt tgc tct tct gga 
Xaa Leu Gly Gly Phe Cys Ser Ser Gly 
405 

att get cct teg tat get ggt aty cte 
He Ala Pro Ser Tyr Ala Gly Xaa Leu 
420 425 

gee aet att cew gga atg rtt ggg cee 
Ala Thr He Xaa Gly Met Xaa Gly Pro 
435 440 

eet gak aae aet rtt gga gaa tgg caa 
Pro Xaa Asn Thr Xaa Gly Glu Trp Gin 
450 455 

get aty aat gtw ttt ggt gcc att ttc 
Ala Xaa Asn Xaa Phe Gly Ala He Phe 
465 470 

gaa gtr caa aac tgg gey mtc art gat 
Glu Xaa Gin Asn Trp Xaa Xaa Xaa Asp 
485 



gcw ttc eta acy ata tea aea 1200 
Xaa Phe Leu Xaa He Ser Thr 
395 400 

ttt age ate aae cat ctg gay 1248 
Phe Ser He Asn His Leu Xaa 
410 415 

ctg ggc ate aea aat acm ttt 1296 
Leu Gly He Thr Asn Xaa Phe 
430 

rtc att gey ara agt etk ace 1344 
Xaa He Xaa Xaa Ser Xaa Thr 
445 

aey gtk ttc try aty get get 1392 
Xaa Xaa Phe Xaa Xaa Ala Ala 
460 

tty aca eta ttc gcc aaa ggt 1440 
Xaa Thr Leu Phe Ala Lys Gly 
475 480 

cac caw gga cac aga mac 1485 
His Xaa Gly His Arg Xaa 
490 495 



<210> 10 
<211> 495 
<212> PRT 

<213> Artificial Sequence 
<400> 10 

Met Xaa Xaa Pro Val Xaa Asp Xaa Ala Xaa Xaa Xaa Gly Glu Glu Xaa 
1 5 10 IS 

Xaa Asp Arg Xaa Xaa Xaa Xaa Xaa Xaa Ala Xaa Arg Xaa Glu Xaa Ala 
20 25 30 

Pro xaa Cys Cys Ser Ala Arg Tyr Asn Xaa Ala Xaa Leu Xaa Phe Phe 
35 40 45 

Gly Phe Phe Xaa Xaa Tyr Xaa Leu Xaa Val Asn Leu Xaa Val Xaa Xaa 
SO 55 60 
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Val Xaa Met Xaa Asp Ser Xaa Thr Thr Xaa Xaa Asp Asn Arg Xaa Ser 
65 70 75 80 

Xaa Xaa Cys Xaa Glu His Ser Ala Pro lie Lys Val Xaa Xaa Xaa Gin 
85 90 95 

Thr Gly Xaa Lys Tyr Xaa Trp Asp Ala Glu Thr Gin Gly Trp lie Leu 
100 105 110 

Xaa Xaa Phe Xaa Tyr Gly Tyr lie He Thr Xaa He Pro Gly Gly Tyr 
115 120 125 

Val Ala Ser Xaa Xaa Gly Gly Xaa Xaa Xaa Leu Gly Xaa Gly He Xaa 
130 135 140 

Xaa Xaa Ala Xaa Xaa Thr Leu Phe Thr Pro Xaa Ala Ala Asp Xaa Gly 
145 150 155 160 

Xaa Gly Xaa Xaa Xaa Xaa Leu Xaa Ala Leu Glu Xaa Leu Gly Glu Gly 
165 170 175 

Xaa Thr Xaa Pro Ala Met His Ala Met Trp Ser Xaa Trp Ala Pro Pro 
180 185 190 

Leu Glu Arg Ser Xaa Leu Xaa Xaa He Xaa Tyr Ala Gly Ala Xaa Leu 
195 200 205 

Gly Thr Val Xaa Ser Leu Pro Leu Ser Gly Xaa He Cys Tyr Tyr Met 
210 2X5 220 

Asn Trp Thr Tyr Val Phe Xaa Xaa Phe Xaa Xaa Xaa Gly Xaa Xaa Trp 
225 230 235 240 

Phe Xaa Xaa Trp He Xaa Leu Val Ser Xaa Thr Pro Xaa Xaa His Lys 
245 250 255 

Xaa Xaa Xaa Xaa Tyr Glu Lys Xaa Xaa He Leu Ser Ser Leu Xaa Asn 
260 265 270 

Gin Xaa Ser Ser Gin Lys Ser Val Pro Trp Xaa Xaa Xaa Xaa Lys Xaa 
275 280 285 

Leu Pro Leu Trp Ala Xaa Xaa Val Ala Xaa Phe Ser Tyr Asn Trp Thr 
290 295 300 

Phe Tyr Thr Xaa Leu Xaa Leu Leu Pro Thr Xaa Met Lys Xaa Xaa Leu 
305 310 315 320 
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Arg Phe Asr. Xaa Gin Glu Asn Gly Phe Leu Ser Xaa Xaa Pro Tyr Leu 
325 330 335 

Xaa Xaa Trp Leu Cys Met lie Leu Xaa Gly Gin Ala Ala Asp Asn Leu 
340 345 350 

Arg Ala Xaa Trp Asn Phe Ser Thr Xaa Xaa Val Xaa Arg Xaa Phe Ser 
355 360 365 

Leu He Xaa Met He Gly Pro Xaa Xaa Phe Leu Xaa Xaa Xaa Xaa Xaa 
370 375 380 

Xaa Gly Cys Asp Tyr Xaa Leu Xaa Val Xaa Phe Leu Xaa He Ser Thr 
385 390 395 400 

Xaa Leu Gly Gly Phe Cys Ser Ser Gly Phe Ser He Asn His Leu Xaa 
405 410 415 

He Ala Pro Ser Tyr Ala Gly Xaa Leu Leu Gly He Thr Asn Xaa Phe 
420 425 430 

Ala Thr He Xaa Gly Met Xaa Gly Pro Xaa He Xaa Xaa Ser Xaa Thr 
435 440 445 

Pro Xaa Asn Thr Xaa Gly Glu Trp Gin Xaa Xaa Phe Xaa Xaa Ala Ala 
450 455 460 

Ala Xaa Asn Xaa Phe Gly Ala He Phe Xaa Thr Leu Phe Ala Lys Gly 
465 470 475 480 

Glu Xaa Gin Asn Trp Xaa Xaa Xaa Asp His Xaa Gly His Arg Xaa 
485 490 495 



<210> 11 
<211> 1485 
<212> ONA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: human/sheep 
consencus sequence 

<220> 

<221> CDS 

<222> (1) . . (1485) 

<400> 11 
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atg ang ten ccg gtc nnn gac nzn gcc cng anc gan ggc gag gag ngc 48 
Met Xaa Xaa Pro Val Xaa Asp Xaa Ala Xaa Xaa Xaa Gly Glu Giu Xaa 
15 10 15 

ncg gac cgc acn ccn ctn ctn cng ngc gcc can egg gen gaa ncc get 96 
Xaa Asp Arg Xaa Xaa Xaa Xaa Xaa Xaa Ala Xaa Arg Xaa Glu Xaa Ala 
20 25 30 

oca gtn tgc tgc cct get egt tac aac nta gea ntt ttg nee ttt ttt 144 
Pro Xaa Cys Cys Ser Ala Arg Tyr Asn Xaa Ala Xaa Leu Xaa Phe Phe 
35 40 45 

ggt ttc ttc ntt ntn tat nca tta egn gtg aat ctg agn gtt gen nta 192 
Gly Phe Phe Xaa Xaa Tyr Xaa Leu Xaa Val Asn Leu Xaa Val Xaa Xaa 
SO 55 60 

gtg gan atg gtn gat tea aan aca act mm naa gat aat aga acn tee 240 
Val Xaa Met Xaa Asp Ser Xaa Thr Thr Xaa Xaa Asp Asn Arg Xaa Ser 
65 70 75 80 

nan gng tgt nea gag eat tet'get cec ata aaa gtt cnt can aan caa 288 
xaa Xaa Cys Xaa Glu His Ser Ala Pro lie Lys Val Xaa Xaa Xaa Gin 
85 90 95 

aeg ggt aan aag tac cnn tgg gat gca gaa act caa gga tgg att etc 336 
Thr Gly Xaa Lys Tyr Xaa Trp Asp Ala Glu Thr Gin Gly Trp lie Leu 
100 105 110 

ggn ten ttt ttn tat ggc tac ate ate aca can att cct gga gga tat 384 
Xaa Xaa Phe Xaa Tyr Gly Tyr He He Thr Xaa He Pro Gly Gly Tyr 
115 120 125 

gtt gcc age ana ann ggg ggg aan ntg ntg eta gga ttn ggg ate ntt 432 
Val Ala Ser Xaa Xaa Gly Gly Xaa Xaa Xaa Leu Gly Xaa Gly He Xaa 
130 135 140 

gnn acn get ntc ntc acc ctg ttc act cec ntn get gca gat ttn gga 480 
Xaa Xaa Ala Xaa Xaa Thr Leu Phe Thr Pro Xaa Ala Ala Asp Xaa Gly 
145 ISO 155 160 

gtn gga ncn ctn ntt gna etc agn gca eta gaa ggn eta gga gag ggt 528 
Xaa Gly Xaa Xaa Xaa Xaa Leu Xaa Ala Leu Glu Xaa Leu Gly Glu Gly 
165 170 175 

gtn aca tnt cca gcc atg cat gcc atg tgg tct ten tgg get cec cct 576 
Xaa Thr Xaa Pro Ala Met His Ala Met Trp Ser Xaa Trp Ala Pro Pro 
180 185 190 
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ctt gaa aga age aan etc ccn agn act ten tat gea gga gca can ctt 624 
Leu Glu Arg Ser Xaa Leu Xaa Xaa lie Xaa Tyr Ala Gly Ala Xaa Leu 
195 200 205 

ggg aca gta ntt tct ctt cct ctt tec gga nta att tgc tac tat atg 672 
Gly Thr Val Xaa Ser Leu Pro Leu Ser Gly Xaa lie Cys Tyr Tyr Met 
210 215 220 

aat tgg act tat gtc ttc tan ttn ttt ggn ant ntt gga atn ntn tgg 720 
Asn Trp Thr Tyr Val Phe Xaa Xaa Phe Xaa Xaa Xaa Gly Xaa Xaa Trp 
225 230 235 240 

ttt ntt ttn tgg ate tgn tta gtt agt gan aca cea naa ann cac aag 768 
Phe Xaa Xaa Trp lie Xaa Leu Val Ser Xaa Thr Pro Xaa Xaa His Lys 
245 250 255 

ana atn ncn cnn tat gaa aag gan tan att ctt tea tea tta ana aat 816 
Xaa Xaa Xaa Xaa Tyr Glu Lys Xaa Xaa lie Leu Ser Ser Leu Xaa Asn 
260 265 270 

cag ctn tct tea cag aag tea gtg ccg tgg nta ccn atn ntn aaa ten 864 
Gin Xaa Ser Ser Gin Lys Ser Val Pro Trp Xaa Xaa Xaa Xaa Lys Xaa 
275 280 285 

ctg cca ctt tgg get atn gtn gtt gca can ttt tct tac aac tgg act 912 
Leu Pro Leu Trp Ala Xaa Xaa Val Ala Xaa Phe Ser Tyr Asn Trp Thr 
290 295 300 

ttt tat act ttn ttg acn tta ttg cct act tan atg aag gan ntc eta 960 
Phe Tyr Thr Xaa Leu Xaa Leu Leu Pro Thr Xaa Met Lys Xaa Xaa Leu 
305 310 315 320 

agg ttc aat ntt caa gag aat ggg ttt tta tct nca ntn cct tat tta 1008 
Arg Phe Asn Xaa Gin Glu Asn Gly Phe Leu Ser Xaa Xaa Pro Tyx Leu 
325 330 335 

ggn tnt tgg tta tgt atg ate ctg ten ggt caa get get gac aat tta 1056 
Xaa Xaa Trp Leu Cys Met He Leu Xaa Gly Gin Ala Ala Asp Asn X«eu 
340 345 350 

agg gea ana tgg aat ttt tea act ntn tgn gtt cgn aga ntt ttt age 1104 
Arg Ala Xaa Trp Asn Phe Ser Thr Xaa Xaa Val Xaa Arg Xaa Phe Ser 
355 360 365 

ctt ata ggn atg att gga ect gen nta ttc ctg gtn gen gen ggn ttn 1152 
Leu He Xaa Net He Gly Pro Xaa Xaa Phe Leu Xaa Xaa Xaa Xaa Xaa 
370 375 380 
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azn ggc zgz gac tat ten teg gen gtt 
Xaa Gly Cys Asp Tyr Xaa Leu Xaa Val 
385 390 

acn ctg gga ggc ttt tgc cct. tct gga 
Xaa Leu Gly Gly Phe Cys Ser Ser Gly 
405 

att get cet teg tat get ggt atn etc 
He Ala Pro Ser Tyr Ala Gly Xaa Leu 
420 425 

gee, act att ecn gga atg ntt ggg ccc 
Ala Thr He Xaa Gly Met Xaa Gly Pro 
435 440 

ect gan aac act ntt gga gaa tgg caa 
Pro Xaa Asn Thr Xaa Gly Glu Trp Gin 
450 455 

get atn aat gtn ttt ggt gcc att ttc 
Ala Xaa Asn Xaa Phe Gly Ala He Phe 
465 470 

gaa gtn caa aac tgg gen ntc ant gat 
Glu Xaa Gin Asn Trp Xaa Xaa Xaa Asp 
485 



gen ttc eta acn ata tea aca 1200 
Xaa Phe Leu Xaa He Ser Thr 
395 400 

ttt age ate aac cat ctg gan 1248 
Phe Ser He Asn His Leu Xaa 
410 415 

ctg ggc ate aca aat acn ttt 1296 
Leu Gly He Thr Asn Xaa Phe 
430 

ntc att gen ana agt ctn ace 1344 
Xaa He Xaa Xaa Ser Xaa Thr 
445 

acn gtn ttc tnn atn get get 1392 
Xaa Xaa Phe Xaa Xaa Ala Ala 
460 

ttn aca eta ttc gee aaa ggt 1440 
Xaa Thr Leu Phe Ala Lys Gly 
475 480 

cac can gga cac aga nac 1485 
His Xaa Gly His Arg Xaa 
490 495 



<210> 12 
<211> 495 
<212> PRT 

<213> Artificial Sequence 
<400> 12 

Met Xaa Xaa Pro Val Xaa Asp Xaa Ala Xaa Xaa Xaa Gly Glu Glu Xaa 
15 10 15 

Xaa Asp Arg Xaa Xaa Xaa Xaa Xaa Xaa Ala Xaa Arg Xaa Glu Xaa Ala 
20 25 30 

Pro Xaa Cys Cys Ser Ala Arg Tyr Asn Xaa Ala Xaa Leu Xaa Phe Phe 
35 40 45 

Gly Phe Phe Xaa Xaa Tyr Xaa Leu Xaa Val Asn Leu Xaa Val Xaa Xaa 
SO 55 60 

Val Xaa Met Xaa Asp Ser Xaa Thr Thr Xaa Xaa Asp Asn Arg Xaa Ser 
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65 70 75 



80 



Xaa Xaa Cys Xaa Glu His Ser Ala Pro He Lys Val Xaa Xaa Xaa Gin 
85 90 95 

Thr Gly Xaa Lys Tyr Xaa Trp Asp Ala Glu Thr Gin Gly Trp He Leu 
100 105 110 

Xaa Xaa Phe Xaa Tyr Gly Tyr He He Thr Xaa He Pro Gly Gly Tyr 
115 120 125 

Val Ala Ser Xaa Xaa Gly Gly Xaa Xaa Xaa Leu Gly Xaa Gly He Xaa 
130 135 140 

Xaa Xaa Ala Xaa Xaa Thr Leu Phe Thr Pro Xaa Ala Ala Asp Xaa Gly 
145 150 155 160 

Xaa Gly Xaa Xaa Xaa Xaa Leu Xaa Ala Leu Glu Xaa Leu Gly Glu Gly 
165 170 175 

Xaa Thr Xaa Pro Ala Met His Ala Met Trp Ser Xaa Trp Ala Pro Pro 
180 185 190 

Leu Glu Arg Ser Xaa Leu Xaa Xaa He Xaa Tyr Ala Gly Ala Xaa Leu 
155 200 205 

Gly Thr Val Xaa Ser Leu Pro Leu Ser Gly Xaa He Cys Tyr Tyr Met 
210 215 220 

Asn Trp Thr Tyr Val Phe Xaa Xaa Phe Xaa Xaa Xaa Gly Xaa Xaa Trp 
225 230 235 240 

Phe Xaa Xaa Trp He Xaa Leu Val Ser Xaa Thr Pro Xaa Xaa His Lys 
245 250 255 

Xaa Xaa Xaa Xaa Tyr Glu Lys Xaa Xaa He Leu Ser Ser Leu Xaa Asn 
260 265 270 

Gin Xaa Ser Ser Gin Lys Ser Val Pro Trp Xaa Xaa Xaa Xaa Lys Xaa 
275 280 285 

hen Pro Leu Trp Ala Xaa Xaa Val Ala Xaa Phe Ser Tyr Asn Trp Thr 
290 295 300 

Phe Tyr Thr Xaa Leu Xaa Leu Leu Pro Thr Xaa Met Lys Xaa Xaa Leu 
305 310 315 320 

Arg Phe Asn Xaa Gin Glu Asn Gly Phe Leu Ser Xaa Xaa Pro Tyr Leu 
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325 



330 



335 



Xaa Xaa Trp Leu Cys Met lie Leu Xaa Gly Gin Ala Ala Asp Asn Leu 
340 345 350 



Arg Ala Xaa Trp Asn Phe Ser Thr Xaa Xaa Val Xaa Arg Xaa Phe Ser 
355 360 365 

Leu lie Xaa Met lie Gly Pro Xaa Xaa Phe Leu Xaa Xaa Xaa Xaa Xaa 
370 375 380 

Xaa Gly Cys Asp Tyr Xaa Leu Xaa Val Xaa Phe Leu Xaa He Ser Thr 
385 390 395 400 

Xaa Leu Gly Gly Phe Cys Ser Ser Gly Phe Ser He Asn His Leu Xaa 
405 410 415 



He Ala Pro Ser Tyr Ala Gly Xaa Leu Leu Gly He Thr Asn Xaa Phe 
420 425 430 

Ala Thr He Xaa Gly Met Xaa Gly Pro Xaa He Xaa Xaa Ser Xaa Thr 
435 440 445 

Pro Xaa Asn Thr Xaa Gly Glu Trp Gin Xaa Xaa Phe Xaa Xaa Ala Ala 
450 455 460 



Ala Xaa Asn Xaa Phe Gly Ala He Phe Xaa Thr Leu Phe Ala Lys Gly 
465 470 475 480 

Glu Xaa Gin Asn Trp Xaa Xaa Xaa Asp His Xaa Gly His Arg Xaa 
485 490 495 
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Fii, Changlin 

<l?D> QBE Toxin ReceptOi? 

<L3D:i CARfi-OO^/OlWO 
<14a> 

<151> lSSfl-07-23 

<no> Patentln Ver. 2.0 

<210> 1 
<2ll> 2602 
<212> WA 

<213> Homo Bap tana 
<221> QDQ 

<222> (5e) . . (1M2} 
<400> L 

togggccggc gctcccttct ctgccaggcg gc^A^acac ctgctcacgt ^ffgcgtc 57 

Atj agg tet ccg gtt cga gac ctg gcc egg ciac gfat ggc gag gag lOS 
Httt Arg ^vr Pro Val Arg Aap Lbu Ala Arg A^n Asp Glly Glu Glu £«r 
IS 10 i5 

gac cge acj cct ctt eta ccg ggc gcc cgb cg^ gcc g4a gcc get isa 
Thr Asp Arsr Thr Pro Ij«u Lsu Pxd 61y Ala Fro Arg Ala Glu Ala Ala 

2C 25 

cca gtg tgc tgc tct get cgt tac aac tta gaa att ttg gcc ttt ttt 301 
Pro Val Cys Cya Ser Ala Arg 7yr Asn Leu Ala Ila Lau Ala Pha Phe 
35 40 45 

ggt ttc ttc att gtg tsL gca tta cgt gtg aat ctg agt gtt gcg tta 
□ly Phe Phe He Val Tyr Ala Le\x Arg VaI Asn Leu Ser Val Ala Leu 
50 S5 60 

gtg gat atg gta gat tea aat aca acb tta, gaA gat aat aga aet tec 297 

1 



WU 01>/U537S 



VaL Asp Met Vai Asp Ser Aan Thr Thr U^u Glu Asp jkba Azg Thr Ser 

70 75 

aag gcg tgt cca gag cat tCC get ccc ata aaa gtt cat cat aat cftA 

Lyfc M« Cyi^ Pro Glia His Ser Ala Pro t/A VaI his His Afiil Cln 
as 90 55 



345 



acg ggt aag aag tac caa tgg gat gCA gsa act caa g^A egg att etc 333 
Thr <3Jly Ly5 X.y3 T^'r Girt Trp Asp Ala Glu Thr Gift 01 y Txp lie Leu 
iOO 1D5 

ggt tec tet Ctt tat ggc t«c Ate 4tc aca cag att cct gga gga tat 441 
Qly fi#r Phc Phe Oy^^ Gly Tyr lla lie Thr Gin ll* Pzo Oly Qly tyr 
lis 120 125 

gtt 9CC age ma ata ggg ggg 4m «tg ctg eta gga ttt ggg ate ctt 49^ 
VaI Ala dar Lya lit Sly €ly Lye Met Leu Leu Qly Phe Gly Ho Leia 
130 135 140 

ggc act get gtc etc acc ctgr ttc act ccc ate get gca gat tta gga 537 
Gly Xhr AIA vm Leu Ibr Lau Phe Thr I>ro He Ala Ala Asp Leu <]ly 
14S 150 155 160 

gtt gga cca etc att gta etc aga gca Cta gaa gga eta gga ga^ ggt 59 & 
VBl Gly Pro Lfiu He Val Ltu Arg Ala Lau Qlu Gly Leu <3ly «u GXy 
Xff* 170 17S 

gtt ac« ttt cca gee atg cat gcc atg tgg tct tct tgg get ccc cct €31 
Val Thr Phe Pro Ala Met His Ala Met Trp fier Ser ^rp Ala Pro Fro 
ifio 1B5 idd 

ctt g»a a^a age aaa ctt ctt a^CF 4tt teg cat gca gga gcft eag ctt 601 
Leu Glu Arg dar Lys htu LftU Sar He dex Tyr Ala qly Ala Oln Leu 
1»S 200 205 

ggg aca gta att tct ctt oct ctt tct gga at a att tgc taa tat atg 739 
Gly Thr VftI He ser Leu Pro Leu ear €ly tie lie cya lyr Tyr Met 

210 215 220 

aat tgg act tat gtc tto tac ttt ttt ggt aet att gga at a ttt tgg 777 
Asn Trp Tftr Tyr Val Phe Tyr Ph« Phe flly Thr Jle Gly xle Phe Trp 
225 230 23S 2«D 

ttt Ctt ttg tgg ate tgg tta gtt agt gac aca cca caa aaa cae aag $25 
Pha 2^u Lbu Trp 11 a Trp Leu Val Ser Aap Thr Pro Qln Lyj Hid Lya 
345 250 2SS 

aga att tec cat tut gaa «ag gaa tac att ett tea tea tta aga aat 373 

2 
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Arg lie Ser His Tyr Ciw l-ys CI^u Tyr rle L»u Str Ser i^cu ;u-g Abt^ 
260 365 aio 

cag ctt tct tea cag j^^cf cci» gtg ccg tgg gta ccc att ttA aaa tec 321 
Cln Leu Ser Ser Oin Lya Ser VaL Pro Trp VaI Pro He Leu Lys 9«r 

2HD 26s 

ctg qca Ctt tgg get <itc cjta get gca cac ttt tct tsc aac tgp act 969 
Ley Pro Leu Irp Ala tie Val Val Ala Hlb Phe Ser T/r Aan Txp Thi- 
230 295 30O 

ttc tat act tta ttg aca tta ttg cct act Cat atg aag ate eta lon 
Pha Tyr Thr Leu Leu Thr L«u pafO Thr Vyr Met Lya Glu lie l^u 

305 310 aiS 320 

agg ttc aat gtt caa gag aat ggg ttt tta tCt tCA ttg cct tat tta I0$£ 
Arg Phe Aan Val Gin Oiu A»n C?ly Phe Leu 6er Ser Leu Pro Tyr L«U 
32& 130 

ggc tct tgg tta cgt atg ate ctg tct ggt caa get get guc aat tta aii3 
Gly Ser Trp Leu Cya Met ll^ Leu Ser Gly gin Alft Ala Asp Asn Leu 
34D 34$ 3S0 

9^gg gca aa«i tc^ «a& ttt ten net tta tgt gtt ogc aga att ttt mgc 
Arg Ala Lys Trp Aan PhB Ser Thr Lau Cya Val Arg Arg Xle Fha Sax 
ass 360 

ctt ata gga atg att gga cct gca gte ttc ctg cjta crct get ggc ttc 1209 
Leu rle QXy Met 1:1 e Qly Pro Al« V«I Phe Leu V&l Ala Ala 01 y Phe 
370 37S aao 

att ggc tgt gat tat tct ttg gcc gtt get ttc eta act »ta tCA 4Ca 1257 
lie Gly eye Aap Tyr Ser lieu Ala Val Ala Phe Leu Thr IX« &er Tbr 
39S 390 395 AM 

aca ctof gga ggc ttt tgc tct tct gga ttt age ate aac cat ctg gat 1305 
Thr Lbu Gly Cly Phe Cya Ser Ser Gly Phe Ser He Aen Hi a Leu A«p 
405 4XD 415 

att get cct teg tat get ggt ate etc ctg ggc ate aca aat aca ttt 1353 
He jaa Pro s«r Tyr Ala Qly He Leu Leu Gly He Thr Aan Thr Phe 
420 425 430 

gcc act att cca gga atg gtt ggg ccc gte att get aaa agt ctg acc 14DL 
Ala Hhx 21a Pro Gly Mat Val Oly Pro v&l il« Ala Lys Ser Leu Thr 
435 440 445 

cct g»t aac act gtt gga gaa tgg caa acc gtg ttc tat att get get 1445 



2 



Pro iKEp ABJi Thr Val Gly aix, Trp Gin Thr Va] t-^e Tyr He Ala Ala 

455 

get «t aat gtt ttt ggt gcc att ttt ttt aca eta ttc gcc a«« ggt 149, 
Ala tie Asn Val Phe Gly ai* He Phe Phe Thr L-eu Phe AU Lya aiy 

'"C' ^75 

gaa gca caa aac Cgg get etc aat gat cac cat gga Cftc age cac 1542 
Glu 7«1 Oln Aen Trp Ala Leu Afin Abj, Hia His Gljr His Arg Hia 

Cgaagrgwcc aataaataat ectgcctcta tCaaC9CaCt Cttatttatc »CgtaAC4tc 1602 
*aagtgcctt ctgtAttgtg taagcattcc atgtctttet ttaattgtac ttgtatcaga 1662 
tCtttaaggc ctataate*t gaaatatCAc Cagetgccag afttAataaaa tgaaetgtgt 1722 
ctaattatga at^Atatgta agct.ggact tctactttag gtteaeatac ctgectgctA 17B2 
Stegggcuao atgaagcagg «cagtCetgt tgatttttta gggccataee **aggga»tg ib42 
agctgaaaca gaectcetga tacctttgct caaetoaact Ag^tgataae tetcagjtac 1902 
tgataaawc ctgttgttgt tc*ctttcct Cfctaaaaatt gbcagctcte tctg«eactt 1563 
agaeetCMaa ctttagcatc tctgcgga^c tgccat^Mc tgt«t4«ttt cgcctggea« 2023 
etggaccgag gggagtgtgc ccaggcagct gcea*gcact ccetccctgg cttcagggtc 
agagcgccca gegtttatca gaggcAgcat ccMgcccag Agccagtgee gactcttcgg 2142 
ctggtgcctt tcctctgagg ggctatcaat gtgtagata* agccct^agt aggcAagagc 21C2 
agtgagatcc actgceatgg tcttgaeaea tccte»aact tteeettcce ngcacagagg 2262 
aatattggct g?ca(;gcaac etgcaaaaga «4aatgcga» gcggccgggc aoggtgg^tc 4322 
aegectgtaa tcccagwct ttggggggct gaggtgggcg aatcacgaga tcagg»gctc 2382 
gagaeeagec tggccagcat ggtgaaaccc c*tctctaee aaaaacacaa aaaatta^et 2«42 
gggcgtggtg acgggcgcefc gtaaceccag atacteagga ggctgftggta gga^aatcac 2S02 
ttgaftcctgg gaggtggaag ttgcagtgaa ccaagatcftc gocactgcac tccageccgg 2S$2 
gogatggaigc gogactccaa ete«aaaaaa aaaaoaaaaa 2^02 
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<2lO> 2 
e2i2> PRT 

<^l^> KOTTO sapiens 
<40Q> 2 

>let Arg Ser Pro val Are Asp Lei^ Ala Ar^ A5n Asp GLy cLu £5lu Ser 
IS 10 15 

Thr Aap Axg ^hr Pro ieu L«u Pro Gly Ala Pro Arg Ala CJlu Ala Ala 

25 30 

Pro Val Cya <^y9 Ser Ala Ar^ Tyr ksn Leu Ala II* Lau Ala Pha Phe 

Qly Phe Phs Ii« Val Tyr Aia leu Arg Val Aen fi*r Val Ala i^u 

S5 CO 

Val A«p M«t VAl Asp fler Aen Thr Thr Leu Olu .Asp Aan Arg Thr g^r 
65 70 75 ^0 

Lys Ala Cya Pro Glu Kl4 ^er Ala I»rD lie Lya Vul Hia hi a Attn CSln 

90 9S 

Thr Qly hyt Lya Tyr «ln irrp Asp Ala Olu Thr «ln qly Txp lie i^eu 
100 105 lio 

Qly flar Phe Phe Vyv Qly Tyr Urn lis Thx Gin lie Pre Gly Gly Tyr 
lis X20 12S 

val Ale Ser Ly* Ila «y Qly Lya Met i,eu Leu Oly Phe Gly Ho Leu 

135 140 

Qly thr Ala val Leu Thr Leu Phe thr Pro lie Ala A2a Aap Leu Qly 
l« iSO 155 X€0 

V4l Qly Pro 2-Bu lie vial Leu Arg AI* Leu Olu Gly Leu Qly Qlu Gly 
l« 170 175 

val Wir Ph# Pro Ala Mat Hi a Ala H^t Trf> Str a«r Trp Ala Pro Pro 
IBd IQ5 190 

Levx Ql)x Arg S«r Lys Lau Lau Ser He Ser Tyr Ala Qly Ala Gift Lau 

153 aoo aos 

Gly Iter val lie tor Leu Pro Leu 5ar Qly lie lie Cys Tyr Tyr Hat 
210 220 



^^^^"^^ PCT/US99/16676 

Asn Trp Thr Tyr Val Phe Tyr Phe Phe Giy Thr JX^ Qly lie ph€ Trp 

23D 

Phe Leu Trp lit Trp Leu Val £er Aap Thr Pro Gin Lys Kis Lye 

250 

Arg 3l€ Ser Kis Tyr Glu Lys GLu Tyr lie Leu Ser Ser Lau Arg A£n 
2^0 265 2-70 

Gin Leu Ser Ser gin Lye Ser Val Pro Trp Val Pro Leu Lys S#r 
27& 3£0 263 

L»u Pro Leu Trp Ala lio Val Val Aia His ^he Ser Tyt^ Aan Trp Thr 
290 29S 3pg 

Ph* Tyr rhr Lau Leu Tlir Leu Lau Pro Thr Tyr Mat Lya lla Leu 

310 313 

Ar^ Phte Asn V«l Olu Glu Acn C3iy B^he Leu Ser &«r Lau Pro Tyr Lau 
325 33D 335 

Qly ser Trp Leu Cyi5 Net lie Lau Ser Oly <3ln Ala Ala Afip Aan Leu 
340 345 350 

Argp AlA Ly« Trp Asn Phe S^r Thr Lau Cya Val Arg Arg lie Pbe S^zr 

3£0 3«S 

Lau lie Gly Met lie Qly Pro Ala Val Phe Lau Val Ala Aj^ «ly ©he 
370 375 iHO 

He Qly Cy» Asp Tyr a Br Leu AXa Val Ala Phe Leu Thr lie Ser Thr 
3«5 390 

Thr l^u Qly <3ly Phe Cy* ear flar Gly Pho Bar lie Aan Hi# Leu Asp 
"*05 410 4iS 

lie Ala Pro Ser Tyr Ala Gly 2Ie Lau Leu Giy II » Thr Aan Thr Phe 
420 435 

Alft Tnr IlB Pro Gly wet Val Qly Pro Val lie Ala Lys Sar Leu llir 

440 445 

Pro Afip Afin Thr Val Gly C51u Trp am Thr Val Phe jyv He Ala Ala 
450 4&^ 460 

Ala 21« Aan Val Phe (Sly Ala He Phe Pha Thr Leu Phe Ala Ly6 Oly 
465 470 <75 4^0 
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Glu val Gin Asn Trp Ma Leu Asn fiep Hii Has ^Jy Mi:= Arg His 

-"e^ 



<22D> 
<221> CDS 

<4Q0> 3 

ccDgggg9e^ ^ggg^cttog gcggtcccgt tggagctctc ttttccjcgg agcAg^ttCg 60 

cgccgtagct ccctgaagg<? «tc atg aag tec ccg gtt teg gac tta gee gcg ii^ 

Met 1^5 Ser Pro VaI Ser Acp Leu Al* Pro 

«9c gac ggc gag gag ggc tcg gac cgc ccg etc Ctg cag cfgc ge^ adl 
Sflr ABp Gly Glu Glu <3ly 5er Aap Arg Thr Pro Leu Din Arg Al^l 
15 26 2S 



ccg egg jcg ^aa ccc get cca g&n tgc bge cct get ogt tac aac eta 
?ro Arg Ala Glu Pro Ala Pre V*l Cys Cy^ Smr Wa Ar^ Tyr A«n Lau 
30 is 40 

gca ttt tcg tcc ttt ttt ggt ttc ttc gtt etc tat tea tt^ egg gtg 
Ala Phe Leu atr Plie Phe Gly Phe ph€ Val Leu Tyr fier iieu Axg Val 



as 7 



45 50 



55 



A»t ctg age gtt gca eta gtg gac atg gtg jac tea aae aca act gcc 305 
Aan Leu Ser Val Ala L«U Val Aep Met VaI Asp ser Asn Thr Thr Ala 
60 €5 70 

AM gat aat aga acg tee t«c gag tgt gca gag cat tct get cca ata 3S3 
Lya Asp Aan Aig Ifcr 8ar Tyr Glu €ya Ala Glu Hl« £«r Ala Pro Zla 
75 00 as 



90 



401 



aaa gtt ctt cao aac eaft aeg ggt aaa aag cac Dgg t^g gat gca gaa 
Lya Vftl Leu Hi I AM Qln Thr dly Lya Lyt tyr Afg Trp Aap Ala Qlu 

" 100 105 

act caa gga Cgg att etc g^a tct ctt ttc tat ggc tac ate ate aea 449 
Thr ain Gly trp lie Leu Gly Ser Ph« Phe Tyr aly T/r lie Ha Thr 
IXD lis 120 



7 



wo 00,^537§ 



caa act cat gga gga tat gtt gcc age aga age 959 ggg ^515 ctg ttg 497 
Gin fic Pro aly aiy Tyr val Ala Scr Arg 5Br CLy OLy Lys Lexj Leu 
125 130 125 

eta gga etc ggg ate ttt get aca get jkcc ttc «C« Ctg etc act ccc 545 
Leu Oly ?h9 Oly lie Phe Ai* Thr Ala lie Phe rhr Leu Phe Thr Pro 
l^D 245 150 

etc get gca gat ttc g^a 9tc gga gcc ctt gtt gea etc ^95 gca eta S53 
Leu Aia Ala Aap Ph« CJly VaL GLy Ala L&u Val Al3 J^t* Arg Ala L«u 
X55 160 aff5 170 

gaa ggg eta gga ga^ gtc dica tat cca gcc atg c^t ^cc atg tg9 641 
Givi Qly L«u Oly Qlu Oly Val Thr Tyr Pro Ala Met Hi^ Ala Met Ttp 
t7S 190 les 

tet tea tgg get ccc cct ctt gaa aga age aag ctt ctgf ajt att tc* tia? 
Ser ear Trp Ala Pro &ro Leu Glu Arg 5er iye L»u Lau Sar He Ser 
IM IM 2D0 

tat gca gga gca caa ctt ggg aca gta gtt tct ctt cct Ctt tct gga 737 
Tyr Ala Qly AlA 3ltt Lau Oiy Thr Val Val Ser Leu Pro L«u Ser Qly 
205 210 215 

gta att tgc tac tat atg aat tgg act tat gtc ttc tat ttc ttt ggc 7^5 
Val He eye ryr Tyv Htt Act) Trp Thr Tyx Val Phe tyr Phe Ph« Giy 
230 22& 23D 

att gtt Ate ate egg ttt att tea tgg ate tgc tta jtfc agt gat 333 
Il« Val Oly Ila lie Trp Phe He Leu Trp He Cya Leu Val Ser Aap 
2S5 240 24$ 250 

aca cca gaa act cac aag aca ate act ccc tat gaa aag gag tat att aei 
Tbr Pro alu Xlir Bia Lya Thr He thr Pro tyr 61u Lya Qlu Tyr He 
2S5 a«a 2€S 

ctt tea tea tt4 aaA aat eag etc tct tea ea^ aag tea gtg ccg tgg 329 
Leu $er Ser Lau Ly» Asn Qln Leu ^er Ser Gin Lya »er val Pro Trp 
270 ^75 260 

ata cct atg ctg aaa tea ct^ cca ctt tgg 9Ct att gtc gtt gea eat 9?7 
He Pro Hat L«u Lye ser Leu Pro Leu Trp Ala He Val Val Ala His 
2d2 290 235 

ttt tet tae aae tgg act ttt tat act ttg ttg acc tta ttg cct act 1D25 
PhB Ser *ryz Abu Vr^ Thv Phe Tyr Thr ieu Leo Thr Leu Leu Pro Thr 
30D SOS 310 



B 
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tac atg aag gaa gtc cts a^g ttc aat att caa gag aat g^g ttt eta 1072 
Tyr Met Lye Glu val Leu Arg Phe Asn lie Qln Glu Asn Gly Phe Leu 
315 32D 32S ' 33q 

tct gca gtc Cct t«c tta g^t tgt tgg tea tgt acg ate ctg teg ggt 1.121 
Ser Ala val Pro Tyr L«u Gly cya Trp i,eu Crys Met lift L*u fi^r Gly 

S^O 345 

caa get get g^c^ aftt tta agg gca aj^ tgg aat ttt CCa act ctg tgg nfig 
Gin Ala Ala Aap Aan Lbu Arg Ai* Arg Trp ASn Pha fler 1:hr Leu Trp 
350 355 360 

gtt cga aga ?ct; ttt age ctt ata ggg atg att gga cct gcg ata ttc L317 
val Arg Arg Val Phe Ser hcxi lie «iy Met lie CLy £>ro Ala lie pha 
i65 370 375 

ctg gtt gcc gca gga ttt ata ggc t$t gat tat tCC ttg get gtt gca ifitfs 
Leu V3i Ala Ala Gly Phe 3Jl« Oly Cya Aap Tyr Ser Lau Ala Vftl Ala 
380 395 350 

ttc eta ace ata tea aca ace ctg gg* ggc ttt tgc tct tet gga ttt 1313 
Phfi Lau Thr lie Ser Thr Thr teu Gly filly Phe Cy* fier &«r Gly Pi* 

40D 

«gc *tc «ac cat ctj gac «tfc get cct teg t*t get ggt att cte rtg 13« 
d«r 21a Aen HIb L*U A6p lie Ala Pro Tyr Ala Gly L©U Lsu 

415 420 425 

ggc ate aca aat acc ttt gee Act «tt cct gga atg Att ggg ccc ate 1409 
Gly He TKr Aan thr Phe AU Thr II a Pro Gly Wet 21« Qly Pro He 
43^ 435 440 

att gcc aga agt ctt acc cct gag act Att gga gaa tgg cAA *ct l«S7 
He Ala Arg sex l^u Tlir Pro Qlu Afln Thr lie Gly 6lu Trp Gla Thr 
44 5 450 4S5 

gtt ttc tgc ate get get get ate aAt gtA ttt ggt gcc att ttc tt<r ISds 
Val Pha CyB He Ala Ala Ala He A0n Val Phe Gly Ala tLe Fha Phn 
490 465 47D 

aca eta ttc gcc aaa ggt gaa gtg caa aae tgg gcc dte agt gat cmc 1SS3 
Thr Leu Phe Ala Lya Gly <3lu Val Sin Asn Trp Ala lie 5er Mp His 
475 400 4SS 4»0 

caa gga cac aga aac tgaaggaacc aatA^ataat cctgtctcta ttaatgtatc ICQ a 
Qln Qly His Arg Aiih 

4» 



wo mt]537S PCT^US99/1 6676 

ttt$cttatc atgta*ccta aaagcgcctc tgatatttta at9tgtaagc a*Cccstaca 16^6 
caagataaaa ttgMCtAga aaasctgtgt tagatttgca ag^cttgtaa cca^ga^acg 173 6 
tcaccagttg ccatataagc aaaattagct atttttaatc atcattaacc cgtctgctg^ 1789 
«at?tcaca«t tcagggtcac acatctggct gcltagtcagg Caacccacaa taggggagtt 1346 
ctatctattt ataa^ACCAt n^ttA^Agag acgagctgaa atag«cctt ccataccttt L90B 
gcttaattao ggtggacaat aattctcag? tcttgctaaa catccgtttt tgtacaccct 1968 
cctCMA*aa ttatctgtc* tcagCAHtcc ctgae*t5ta ggtctCftUAC cttagcctct 2029 
ccacggagct g^C^gec^ct gtatcattca gcctggcaac ttcactgagg gM^CAtgcc 2mB 
csg^cngctg ccacatgtcc cctctctggc tteagggaca gtgccctigca cttaggcagc 
atccaagacc aigggtcegcg ccaaggcttt ggacggtact cctcccctgg ggctgttaat Z20^ 
gftgtggatga agccctgA9<? c«M*3gga«r agcgcgatcc acagtccitgg tttccatgca 
Gcccc&ccet tcccttccca gcacactg^a gtnttgcctg 9M&gtA4CC tgcaaaagaa 
agtgtgatgc ctaattegce aeatiataaca tcatccttga tgatcctacc tticacatgga 2SaS 
tc&gagtata aatctccaag Ccctgtgttc taggagctac accagaataa tta«44tata 2448 
aaaagaaAC4 ^aacattsttc tctg&ctgac accteiagtgt cCggt&gcA^ ttcaaggcta 250 b 
aagtgacttc tacttcacat aacctgcaacr cggtggtgt* ttcttcttta gtgttgg^tt 
cttMatetE atttttceag tttCtcctgg accatrcttcc agtggttttg agcaegettt 2629 
gagggca&tt Atgtgattta gaacctgatt aatgtttc^c Cgtg&atgtt ^Nta^cCacc 26 a a 
tgt4»&at2it taactaMgc tatttaatgt aatatgatgt gtatacattc tg^aaattaa 2748 
ttt&taaatc &9taast«gc ttitaagttgc catggtgatA tttcttttM HMtcaaaat 2406 
aa^s^cttttt ggaac;ga&aa taaMaaaaa aaaaaa 2944 

c2\2> PRT 
<213> Ovia ap. 

10 
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wet Lys Ser Pro Val Ser ABp L-au Pro Ser ksp Gly Glu Giu ^3Ly 

£tr ftsp Arg Thr Pro LfeU LcU Arg Ala Pro Arg Ala Glu Pro Aia 
20 3^ 30 

Pro Val eye Cys ser Ala Arg Tyr Aan Leu Ala Phta Leu Ser Phe Phe 
^5 40 4^ 

aiy Phe Phe Val Leu Tyr Sar Leu Arg Val Aan L«u iier Val Ala Leu 
50 5S €D 

V*l ABp Met Val Aflp Sex Afin Tbr ^hr Ala Lya Asp Asn Arg Thr fier 
«S 70 75 ga 

Tyr Glu eye Ala Glu Hie Ser AIa Pro Xle Lya VaI L«u Kia Asn Oln 

95 

Gly Lys Lys Tyr Arg Trp Aep Alft Olu Thr CSln Gly Trp lie Leu 
lOQ 105 110 

Qly Ser Phe Phe Tyr oly Tyr II9 lie thr Gin Il« Pro Gly Ely Tyr 
115 ' 120 X2S 

Vftl Ala fl«r Arg fiar Gly Gly Lya Leu Leu Leu Gly Ph^ Gly Il« Phe 
130 135 14Q 

Ala Thr Ala lie Ph« Tlir Leu Phe Thr Pro Leu Ala Ala Aap Phe Gly 
H5 15^ 155 160 

Vel Oly Ala Leu Val Ala Leu Arg AlA Leu dlu Gly Leu Qly Qlu flly 
165 170 175 

val Thr Tyr Pro Al« Mat Hia Ala Nat Trp Ser Ser Trp Ala Fro Pro 
lao 195 l»0 

Lbu Glu ser i<y9 Leu Leu ser Zle fter iryr Ale Gly Ala Gin Lau 
1» 200 205 

Oly Thr Val Val Ser Leu Pro i^eu Ser Qly Val lie cya Tyr Tyr Met 
2X0 21& 

A«n trrp Thr Tyr Val phe Tyr Phe Phe Oly lie Val Gly He He Trp 
235 220 325 240 

Phe I La Leu Trp He Cya Leu val Ser Asp Thr Pro Glu Thr Hie X^y* 
245 2S0 255 

II 



Thr Ue 7hr Pzo '^yir Oiu Lye Giu Tyr lie L^ii Scr S(>t Lt^^ Lys Asn 
2^^* 26& 2^0 



Gin Leu Ser Ser Gin Lya Str Val Pro Trp lie Pro Met 1.bu l,ya s<ir 



3es 



Leu Pro L^xi Trp Ala He: Val vai Ala His Phe fier Tyt Asn Trp Thx 
250 295 laxi 

Phe Tyr Tbr Lew Leu Thr Leu Lau Pro Thr Tyr Met Ly« <3iu val Lay 

310 315 

J4rg Phe II^ <Sln dlu Asn Gly Phe Lau Ser Ala Val Pro Tyr Lbu 

Gly cys rrp Leu Cya Met He Leu Ser Oly Gin Al<i Ala A»p ksn Lto 

Ar3 Ala Arg Trp Aen PH* Ser Thr Lbu Tip V4l Arg Ax^ Val Ph* s^r 
355 3«o 

Leu lie Oly Met JIg Gly Pro Ala ne Pho Leu V«l Ala Alft flly phe 
370 375 

lie Oiy Cys Aap Tyr S4r Lau Ala Val Ala phe Leu Ttor lie Ser Thr 
3" 3^s 

rhx Lfiu Oly fliy PhB eye S«r fler Gly Ph^ fler He Aan His Leu Jks^p 

410 415 

He Ala Pro Ser Tyr AlA Gly lie Leu Leu Gly He Thr Afln Thr PhB 
^20 425 450 

Ala Thr lie Pro Gly Itet He oly pro He He AlA Arg Ser Leu thr 
4iS 440 445 



Pro Glu ABn Thr He OLy Glu Trp Gin Thr VaI Phe Cye Zla Al* Ala 



45S 4(0 



AiA lie Asn Vftl PhA Oly Ala He Phe Phe Leu Phe Al4 Lya aly 



480 



filu Val Gin Aftn Trp Ala He Ser Asp iJia Gin <lly ttia Arj Afin 

49P 495 



<2l{]^ 5 
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<2ll^ 31 
<212> DNA 

i:2aj> Artificial Sequence 

<223> Pescription of Artificial Sequence: MR primer 
<400> 5 

ogggntcccg ccngcnatgc ayrahrt^t^ g 

<aao> 6 

<212> D»A 

<2I3> Ar&ifici&I jsegu^ncfl 

<223:. Desrcripticn of Artificial Se^^ueacer PCr prinwr 
<400> 6 

e9*attccdg gdgcratfctc narrtrrCt 



<21U> 7 

<213> Komo sapienj 
<220> 

<222> t263> . . (1570) 
<400> ? 

gttoggtcga agfccctcccc ttantt*tgt gcaattcaag tffcccactgc ccgcec^caa 50 

gcccccactc atcctegct^ <9^gcagggt ggcccctjca ctttae*«gg gggt^cagge^ aSQ 

SogOTagaeg gtcgtccgaa caceggctcc ccggcatgcg tii^Accggcg ggoggagcgg XBO 

gctcactttg cgcwAtcrt fccgagaacte <rc«9iittctcc gcttccctug tca^Acccaa 240 

gccagagttg cccacaccta ag atg gcg gcg ggg gcg Jitg aca ccg ccc cgc 292 

Mat JUa Ala Oly Ala Met 'Chr Pro tro Arg 

eeg gtc Cag cca get cg^ ccc ggg ggc ttc ggg ctg teg ggc egg cgc 340 
Pro Val Gin Pro AIa Axg Pro Gly <Jiy phe flly Leu ser Oly Arg Arg ' 
15 2t) 25 

1^ 
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tec etc etc tgc C9S gtg age aca cct gcc cac gta ggc ^tc atg 
Ser Leu I^u Cys Gin Val Ala S€r Thr ?r« a1« His Val Gly Val H*t 

40 



3»B 



agg tct ccg get ega gac ctg gee egg aac gat ggc g«g gag ^cg 436 
Arg ser fro Vol Arg Aap Leu Ala Arg Aap Giv Glu Glu Ser rhr 
*5 50 5^ 

gac cge Aeg cct ctt eta ccg ggc gcc c«« egg gcc gaa gee get eCA 484 
Aap Arg Thr Pro Leu Leu Pr* Oly Ala Pro Arg Ala Glu Ala Ala Pro 

65 70 



gtg cge tgc tct get cgt t^c Aac tta gca att ttg gcc ttt ttt ggt 
v*l Cys cya Ser Ala Arg Tyr Aan Leu Ais H© Leu Al« Phe Ph« <31y 
" 0^ 85 90 



ggt aag asg tac cm tgg gat gcs gaa act eaa gga tgg att etc ggt 
6ly Uys Lys Tyr <3lii Trp Aap Ala <31u Tttr Qla Qly Trp lie L«u eiy 
"0 145 1.50 



552 



ttc tte att gtg tAt gca tta cgt gtg ««t ctg ago gtt gcg tta gtg seo 
Ph« Phfl lie val Tyr Ala X^eu Arg Val Asn Lea Ser val Ala Leu Val 

gat atg gtft gat tea Mt aca act tta gaa gat aat uga act Ccc aag 
Aap Met V»l Aap Sttx Asn Thr Thi Leu tJlu Aap Ajm Arg rhr Sar r.yj 
110 115 lao 

gcg tgt cea gag cat tct get ccc at« aaa gtt eat cat Mb caa acg «76 
Ala Cya Pro <Jiu Hia 5«r Ala Pro Ila Lya val Hia Hl« Aan Glr ihr 

130 135 



?24 



772 



tec ttt ttt cat ggc eac ato ate aca cag att cct gga gga tat gtt 
ser Ph« Pbe ryr oly Tyr lie lia Thr Qla Ue Pro Qiy oiy ^yy val 

l«D les 170 

gcc age aaa ata ggg ggg aaa atg ctg eta gg« ttt ggg ate ctt gge 
Ala ser Lya lie aiy flly Lys Hst Lau Leu Qly Phe Qly He Leu «y 
ITS IBO 185 

act get gte etc aee ctg ttc act ecc att get gea gat tta gga gtt 
Ala Val Leu Thr Leu Phe Tlir Pre li© Ala Ala Asp Leu Gly Val 
190 155 200 

gga e«a etc att gca etc aga gca eta gaa gga eta gga gag ggt gtt sis 
<lly Pro Leu lltt Val Leu A*g Ala Leu Slu CSly Leu Sly Glu sly Val 
2<J5 210 

14 



820 



6B8 
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aca ttt cc* gc^ «tg cat gcc atg c^g cct tct t^g get coc ccc ctt 964 
Thr Phe Pro Ala Met His Ma Met Trp Scr Scr Trp Aia Pro Pro Lbq 
220 225 

gaa *c,a age «aA etc ctt age att teg tat gcA gga gc« cag ctt gg^ id12 
Glu -<0:g Ser Lya Ley Leu Ser 31e 5er Tyr Ala Gly Ala Gin l^y GLy 
235 240 

aca gca att tct ctt cct cct Cct g^A jica act tgc Cac tat acg aac aO€0 
Thr Vfil lit Ser Leu fro Leu ser Giy rie He cy3 Tyz Tyr W^t Aan 

26D 

tgg act cat gtc ttc tac ttc ttt gjt «ec att gga ata tct tgg ttc iioe 
Trp ^hr Tyr Val Phe Tyr Ph« Phe Qly Thx lie 01 lie Phe Txp Phe 
270 275 330 

etc te^ tgg tgg tta gtc agt gac aca cca caa Aaa cac aa^ aga iisff 
J^eu L«u Trp lie Trp liQu Vai Ser Aap Thr Pro .Oln Lys Hia Lys Arg 
2^5 295 

att tec cat tat gAft ftug gaa tac ate ctt tea tea tta aat ca^ 1204 
He S^r Uis Tyr Qiu Lya Olxj Tyr 5le Leu Bar Ser Leu Arg Asn Oin 

300 aoa 310 

ctt tct tea cag aag tea gtg CCg tgg gta ecc ttt tta aaa tec ctg 1252 
Leu Ser Ser Gin Lya 5er V*l Pro Trp Val Wo He Leu Ly< fl»r Leu 
315 320 32S 33Q 

cca ctt tg^ get ftte ^ta gtt gca cac ttt cct tac aae tgg act ttt 130D 
Pro Leu Trp Ala lie Val Val Ala Hid Phe Ser Tyr A«a Trp Thr Fha 
33S i4a 34^ 

tat act tta ttg ac« ttA ttg cct act tat Atg aag gag *tc eta agg 1346 
TVr Thr Leu L»u Thr Lau Lau Pro Thr Tyr Mat Lya Qlix llm Lau Arg 
350 155 3€0 

ttc aat gtt caa gag aat ggg ttt tta tct tea tt^ cct tat tta ggc 1296 
Phe tksn Val 91ci Qlu Asn Gly Phe Leu tfar 5©r Leu Pro T/r Lau Gly 
365 37Q 375 

tct tgg tta tgc atg ate ct^ cct ggt eaa get get gac aat tta agg 1444 
S«r Trp Lau Cya Nat lie Leu Ser Qly Oln Ala Ala Asp A*tl Lau Arg 
3S0 395 3»0 

BQR aaa tgg aat ttt tea act tta tgt gtt cgc aga att ttt age ctt u^s 
Ala Lys Trp Aan Phe Ser rhr Lev* Cys Val Arg Arg Xla ?he Ser Leu 

395 400 4D& «10 

1& 
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ata 99a ac9 «tt gga ctt gca c?ca etc ctg gta gcc get 590 ttc att 
He GLy Met lie Gly Pro Ala val Pl>e Leu VaJl Al^x Ala Gly Phe rie 

<120 425 



1540 



ggc tgt gAG cat tct ttg gcc 9tt get ttc eta act ata tea aca aca 1566 

Gly eye Aap Pyr S«r Leu Ala Val AlA Phe L^u Th^ lie SEr Thr Thr 
^30 4iS 440 

ctg gga ggc ctt Cgc tct tct gga ttt age ate Mc cat ctg gat att ifiis 

Leu Gly Qiy The Cys Ser Ser «y ?he Ser Il« Aan Kla L«u Agp 3i« 
4*3 4S0 455 



get cct tog tat get ggt atc etc ctg ggc atc *ca aac Aca ttt gcc 
Ala Pro Ser Tyx Ala Qly ll« Lou Lea Gly He Thr Thr Phe Ala 
4tfD 465 470 



act ACt cca gga atg gtt ggg ccc gtc atc get aaa agt ctg acc cct 1733 
Thr 11 R Pro Gly Met Val Oly Pro Val Il<t Aia Lya Ser Lbu Thr Pxo 

4«S 450 

gat aae act gtt gga gaa tgg caa acc gtg ttc tAt act get get get 1780 
Aap A«n TUf Val Qly <Slu Trp Qln Thr Val Phe Tyr Ha Aja Ala 
455 5U0 SOS 

att oat gtt ttt ggt gcc att ttc ttt ac« eta ttc gcc «aa ggt gaa i«e 
XJ.O Aan Vftl Ph* Gly Ala Ila Pte Phe mr Leu Ph« AJ.41 Lys Gly fllu 
51& 515 S20 

gca cAa 43c tgg get etc aat gat cac cat gga cac aga cac ag?D 
Val Oin Asn Trp Ala Leu Aan A^ Hia kia <5ly Kia Arg Hl« 
525 53d 53g 

tgaaggaacc aataaataat cctgccteta ttAatgtatt ttt att tat c atgtAacctc i&30 

aaagrtgcctt ctgtattgtg taegcattct atgtcttttt ttaattgtw Ctgtattaga 1590 

tttttaaggc ctataatcab gaaatatcac tagttgccag aataaCMaa tgaectgtgt 2DW 

tt4attatga ataatatgta agctaggaet tctactttag gttcacatae ctgcctgeta 2110 

gtcgggcaac atgaagtagg acagttctgt tgatttttta gggccatAct aaagggaatg 2 no 

agctgaaaca gaccccctga taCctttgct taattaaact agatgat*at tctcaggtac 22 3D 

tgataaacac ctgttgttgt t cac ttt cct cataaaaatt gtcagctctc tctgacactt 2290 

a93crtcaaa ctttagcatc tctgtggagc tgccacccac tgtataattt cgcctggcaa 2350 

16 
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ctggaccgag sgS*gtgtgc ccaggcagct gccaAgcact ccccccccgg cttc«gggtc 24io 
sgagtgcccA gcgtttatca gaggcagcdt ccaagcccag agcc .^tgtc gactcttcgg Z^io 
ctggtgcctt tcctctgagg ggcCfttcasc gtgt^gataa agccctgagt aggc^agagc 2520 
agtgagatcc actgctatgg tcttgatAca tcctcaaact ctccettccc agcac^gugg 2S50 
aatattggct ggcatgea«c ctgcaaaaga Anaatgcgaa ^cggccgggc acggtggctc 2SS0 
acgccegeaa tcccagcact te^gggggct gaggtgggcg astCAtgaga tcag^«gttc 2 7io 
gagaccagcc CSfjCCA^MC ggtgaaaccc CUtctctact aaaaatacaa ftnaactagct 2770 
gggcgtggtg acgggcgcet gtMtcccag atactcagga g^ct^aggca ggag«tcac aajo 
ttgaacctgg gaggtggaag ttgcagtgn.a ccaagatcac gccactgcac tccagcctgg 2fl50 
gcga engage gagactccaa ctcsaanaaa aaaaaaaaaa 2 Sao 

<2ll> 53€ 
<312> PRT 
<217> Hon*Q aapiena 

<400> 0 

Mat Ala klA QXy AlA Wat Tlir Pro Pro Argf PXd Vai Gin Pro Ala Arg 
1 5 Iff 15 

Pro Qly Gly Phe Gly Leu 5er Qly Ars Arg Sar Leu Leu Cyfli Qin Val 
30 2& ^0 

Ala Ser Thr ^ro Ala Hia Val {51y V^l Met Ar^ d«r Pzro Val Arg Aip 
33 40 4S 

L«u Alat Arg Asn Aap Qly aiu Slu der xhr Aap Arg ThT Pro Lau Lau 

SO £S 60 

Pre OLy Ala pro Arg Ala Qlu Ala Ala &ro Val Cya cya sar a) a Arg 
65 70 75 fto 

Tyr Mn 1*11 AIa iXe L«u Ala Pha Ph© CSly Pha Pha lie Val Tyr Ala 
95 9D 95 

Leu Arg val Aon I^u 5er Val Ala Lau Val Aap Mac Val Aap aar Aan 
lOQ 105 IID 

1-? 
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Thr Thjr Leu GJu Asp A3n Ar^ Thr S«r Lys AlJi Cya Pro GXu Hia Ser 

120 125 

Ala t»TD lie LyB Val Lis His A«n Cln Thy Gly l.ys Lys Tyr Gin Trp 
ISO 13S iflO 

Asp AlA Glu Thr Qin Gly Trp lie Leii Qly S*t Phfi Phu Tyr CJly Tyr 

150 15S 

lie ll« Thr <Jlrt lie Pro Gly CJly Tyr Val Ala 5er Lys tie Gly Gly 
1^5 170 

Lye Net Leu L«tJ <Jly Phe Qly ll« Leu Gly TKt Ala Val Lau Thr Leu 
180 18S 15Q 

?ha Thr Pro He AlA Ala Aep L^u Qly VaL Gly Pro Leu 11© Val L»u 
1»S 20O 205 

Ar^ Ala Lbu Gla Gly Leu Oly Glu Gly Val Thr Ph« Pxo Ala M«t Hi» 
210 2l£ 320 

Ala Mat Trp fler 9«r Trp Ala Pro Pxo Leu Glu Arg fier Ly« Leu L«u 

225 aaa 235 34^ 

ser 21a Ser Tyr Ala Qly AlA C31n Leu Qly Tht Val He flar Leu Pro 
245 aSD 

Lbu ser Gly He 11^ Cya Tyx Tyr Wet Asn tip Thr Tyr Val Pbe Tyr 
250 368 270' 

Phe Phe Sly Thr rle gly lie Phe Trp Phe Leu lau Trp He Tip Lau 
27^ SaO 

V^I Ser Aep Thr Pxo Qln Lys Mt9 l>y* Ai^ He &er Hia Tyr Olu Lya 
390 295 200 

Olu Tyr He i<u 5er Ber Leu Arg A^a Oln L«u Ser $er Gin Lye s«r 

310 il5 

Vtl Pro Trp Val Pro He Leu Lye ^er Leu Pro Leu Trp aIa He v«i 
Z25 330 

Val Ala Hia &he 3er Tyr Asi) Trp Thr Phe Tyr Thr Leu Leu Thr Leu 
«0 345 350 

Leu Pro Thr Tyr Net lyi olu Ha Leu Arg Phe Aftn Val oln Olu Aen 

360 d«5 

IB 
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Cly The Leu ser ser 

Leu aiy <51n Ala 

Thr Leu Cye v^l Ax^ 
405 

A^A Val Phe Leu 
420 

Alai VaL Ala Ph« L<u 

Ser qiy ^he ser lie 
450 

lie Leu Leu Oly lie 
4G5 

Qly Pro Val Zle Al* 
4dS 

rrp oln Thr Val £>he 
500 

lie Phe Phe Thr Leu 

Aan A^p Hia His 21y 
»3P 



Leu Pro Tyjr Leu dly Ser 

Ala Asp A^n Lfiu Arc? A2a 
390 IBS 

Arg Ilia Phe 5er Leu He 

Ala Ala Qly Phe lie Gly 
425 

Thr He S«r Thr Thr Leu 
440 

Asn Hia Leu Asp lie Ala 
4 55 

Thr Afln Thr Phe Al* Thr 
470 47S 

Lya ser Leu Hir fro Asp 
4»0 

Tyr 11^ Ala Ala AXa lla 

505 

Phe Ala Lya Qly Olu Val 

UU Hia 
53& 



Trp Cya M«t tie 

Lys Trp A^n Phe str 
4D0 

Oly wet lie i3Ly e>ro 
41S 

Cya Aap Tyr Ser Leu 
430 

<31y Gly Ph« Cys Ser 
445 

Pro fler Tyr Ala Gly 
460 

31e Pro Gly Met Val 

4ao 

Asn Thr v*l Qiy gIu 
4»S 

Aan Vecl Phe Gly Alu 

Gin Aan trp Ala l^u 
325 



<2I0> 9 
<311> L4aS 
c212> DMA 

Artiflti^ial Sequanae 



<223> lie script Ion of Artificial Sequences human /oheep 
conceiiBus aequeQC^ 

<220> 

<221> CDS 

•c:222> a) [14as) 



1^ 



RCTrtJSWl6676 



<10D> 9 

a eg *rg tcy ccg gtt ysr gac ytr gcc «g «rc ^fty ggc ga9 g«g r^c 43 

Met Xaa Xaa Pro 7al Xaa Aap Xaa Aiet Xaa Xaa Xaa Gly Glu Giu Xaa 
1 10 15 

v'cg gac cgc «cr cck cty ctr crng egc gcc ccr <^3g gcB guji sec get ^6 
Xaa A5p Arg Xaa Xaa X«a Xaa KaA Xaa JLLa Xaa Arg xa^A Glu ;caa Ala 
30 25 30 

cca gtr Cgc tgc tct get egt t«c sac yta gca wtt ttg kcc ttt ttt 144 
Pro Xa* Cy5 Cya Ser Ala Are Tyr Aan xaa Ala xaa Lbu Xaa Fhe Ph< 
35 *0 45 

g^t ttc ttc rtt etB tat kCA tea cgJc gcg aat ctg *gy cfct gcr yta 152 
<3Iy Pbe Phe XHA XM Tyy Xaa Leu xaa Val Asn L«U Xaa Val Xaa XM 

50 35 €D 

gtg gay at^ gtr get tea ft^y aca act kyra raa gat Mt aga ack tec 24Q 
VaL Xaa Net Xii«i Afp Ser Xaa Thr Thr Xaa Xaa .ASp Aazi Arg Xaa S«r 
tfS 70 75 ao 

va« gmg tgt aca gag cat tct get ccc ata aaa gtt cwt cay nay caa 3a e 
Xaa Xaa Cys Xaa ola hi a ser Ala Pj>o lie xya v^l xaa xaa Xaa CILn 
e& 90 95 

acg ggt aar aa? tac err: tgg gat gca gaa act caa gga tgg att etc 336 
Thr Gly xaa Lys Tyr Xaa trp Asp Ala QIm Tht Oln Sly ^rp lie Laii 
lOD 105 110 

gg>f tcy ttt tty tat ggc tac at« ate aca car att cct gga g^a tat 3B4 
Xaa Xaa Pha Xaa Tyr Qly Tyr lie lie Thr xaa He Pro Gly Oly Tyr 
115 ISO 125 

gtt gcc a^c ara akw ggg ggg aar mtg ytg eta gga tty ggg ate ytt 432 
VaL Ala dar Xaa Xaa Gly Qly Xaa &aa Xaa Lau Gly Xaa Gly Il« Xaa 
130 135 140 

gay aew get rto ytc acc ctg ttc act cee mty get gca gat ttm gga 4dO 
Xaa Xaa Ala Xaa Xaa Thr l#SU Phe Thr Pro Xaa Ala Ala Asp Xaa Qly 
145 ISO l$5 ISO 

B^Y Cty rtt gya etc agr gca eta ^aa ggr eta gga gag ggt 520 

Xaa Gly Xaa Xaa Xaa Xaa Leu Xaa Ala Lau Glu xaa lieu Gly Glu Gly 
t€S no 175 

gty aca twt cca tfCC atg eat gcc atg tgg tct tew tgg get cee cct 576 
Xaa Thr Xaa Pro Ala Mat Kls Ala Hat Trp Ser X« Trp Ala Pro Pro 
U<1 IBS 190 

20 
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Ctt gaa aga asc aar etc ctk Ajy att tcr tat gca gga gta car ctt 634 
Leu <3iu Arg Ser Leu Xaa Jtaa lie Xaa Tyr SiJlh Gly Ala Xaa Lmu 

t9S 200 205 



Sgg aca gta rtt tcb ctfc cct ctt tct gga rca att tgc tac t«t atg 
Gly Thr Val Xajj Ser Leu pro Lau aar Oly Xaa He cy5 Tyr Tyr H«t. 
iJ-^? 2L5 220 



612 



aat t$g «cc tat gtc tt'c tAy tty ttd ggy ayC rtt gga atm wty t^g 750 
A«n Trp Thr Tyr Val Pht Xaa Xaa Phe Xaa XAa Xaa Gly Xaa X^A Trp 

230 235 34,5 

ttt met ttr tgg ate tta gtt agt gay ac« cca aaa cac aag 76e 

Pho Xaa Xaa Tip lie Xaa Leu Val Ser XA« Thr Pro XAa Xaa His liyu 

a*ft Aty wcy cmy tat ga<t na^g gar tay att ctt tea tea tea arih aat 916 
X«a Xaa Xaa xaa Tyr Qiu Lya Xaa Xaa J;I« L«u Sex S«r Leu Xaa Abu 
250 265 27D 

cag ccy tct tea cag aag tea ^eg ccg tgg rta ccy atk ytr 4iaa tetn 
Gin Xaa Ser Ser cSln Lya Q« Val >ra Trp xaa xaa Xaa Xfta Lya xaa 

ctg cca ctt tB9 9ct aty gtm gtt ge« cay ttt tct aac tg^ «ct 912 

LBu Pre Lttu Txp Ala Xaa xaa Val Ala Xaa Phe ser Tyr Asn Trp Thr 
390 295 ino 

ttt tat act ttr ttg acm tta ttg cct act tay at 9 aag gar rtc at A SfiO 
Tyr thr xaa hto xaa Ltfu Leu Pro Thr Xaa Mat Lys Xaa Xaa 
SID 315 

agg tte aat rtt caa gag aat ggg ttt tta tct kca kta cct tat tta 10 &a 
Arg ?htt Aan xaa Gin Olu Ain 01 y Phe Lau Sar Xaa Xaa Pro T3rr lieu 
325 i30 335 

ggy tat tg^ tta tgt atg ate ctg tck ggt caa get get gac aat tta X05€ 
xaa Xaa trp ieu Cy« Met Il€ Leu 3Laa Gly Gin Ala Ala Aap Aau l«u 
340 :k45 

3gg gca ara tgg aat ttt tea act ytr tgk gtt cgm aga xtt ttt age 11 g 4 
Arg Ala Xaa trp Asn Pha Ser Thr Xaa Xaa Val Xaa Arg Xaa Ph« Ser 

l«a :365 

Ctt ata ggr acg att gga cdt gcr rta ttc ctg gtw gey gcw ggtti tty 115a 
LBU ila Xaa Mat 11b Gly pro xaa xaa phe teu Xaa Xaa Xaa Xaa x a 
370 US 3Q0 



71 
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atw ggc t^t gac tat tcy ttg gey get gcw ttc eta acy ata tea aca i^DO 
Xaa Gly Cya ABp Tyr Xna Leu Xaa Vai xaa Phe Lu^i xaa Il« 5er Thx 

39Q 355 400 

acm ctg gga g&c ttt tjc tct tct gg* ttt age ate aac cat ctg gay 124B 
Xaa Cly Gly Phe Cys Ser ser Gly Phe S^r tie Aan kIb Leu Xaa 
405 410 415 

ett get cct teg tat ^jct ggt acy etc ctg ggc ate aca aac e^cm ttt 12 96 
lie Ala P>ro S<tr Tyr Ala GLy JCa» L^u Lou Gly rle Thr A«n Xaa Phe 
420 <82S 

gcc *ct *tt ccw gr9« atg rtt ggg ccc ttc att gey ara agt ctk acc aj44 
Ala Thr ria Xaa Oly Met Xaa <3ly Pro Xaa He Xaa X»A S«r Xaa Thr 
435 440 44S 

ccc ga>c aac act rtc gga gaa tgg <*a« acy gtfc ttc try aty get get 159a 
Pro Xaa Asn thr Xaa Gly Glu Trp Oln Xaa xaa Phe Xaa Jlaa Ala Ala 
450 4C0 

get aty aat gtw ttt ggt gcc att ttc tty ac« eta ttc gcc aaa ggc 144£> 
AlA Xaa Asn xaa Fba fliy Ala lie Phe Xaa Thr Lau Pha Ala t^ya aiy 
465 470 475 4fi0 

gaa gtr caa aac tgg gey ffltc art gat cac aaw gga cac «gA raac 14fiS 
Olsx Xaa Gin Asn Trp Xaa Xaa Xaa Asp Hi a Xaa Qly mis Azg Xaa 
495 4^5 



<2lO> 10 
<211> 455 
c2l2> PUT 

<213> Artificial ^aquan/Ca 
<40d> ID 

Net Xaa Xaa Pro V&l Xaa Aap Xaa Ala xaa Xaa xaa Gly QlVk Ql\t Xaa 
15 10 15 

Xaa Aap Arg Xaa xaa Xaa Xaa Xaa Xaa Ala Xaa Ar^ Xaa Glu Xaa Ala 

20 25 3Q 

Pro Xaa Cys Cys Ser ^a Arg Tyr A&n Xaa Ala Xaa Leu Xaa &he Phe 
35 40 43 

Gly PhB Pha xaa xaa Tyv xaa Leu Xaa Val Asn Lau Xaa Val Xaa Xaa 
50 55 ^0 



23 



BNSOOCID: <WO 0005375AlTt > 



wo 0Q^n5375 



Vaj. xaa Wet Xaa Asp ser X^a Thi Thr XAa Xaa Asp Aan i^rg Xaa Ser 
6& 70 75 

Xaa X*xt Oyt Xaa CSlu Wi^s Ser AI^i Prci lie lys Val Xaa Xaa :iaa Oin 
as 90 9s 

Thr Qly Xaa Lys Tyr Xaa Trp Asp Ala Olu Thr Gin Qiy Trp zie Leu 

100 105 aao 

xaa xa« Phe Xa^t TVr CLy Tyr He He Thr xaa lie Pro Gly giy tyr 
lis X20 125 

Val Ala Btt XAH Xa« Qly Gly Xaa Xaa Xaa Leu Gly XAa Gly He Xaa 
IJO 135 140 

Xaa Xaa Ala ^aa Xa^i Tbr Leu Phe Tlir Pro X«a Ala Al* Aap xaa <3ly 

150 ISS ISO • 

Xaa Gly Xaa Xaa Xaa X^^ Leu Xiia Ala Leu C?lu Xaa I/eu gly Glu Qly 
165 I7t> 

Xaa Thr Xa« Pro Ala Mat Hia Ala Wet Txp flar Xaa Trp Ala Pro ?Xo 

ifio les • 1^0 

Lau Glu krg 5er Xaa teu Xaa Xaa lie xaa Tyr Ala Gly Ala Xaa Leu 
i95 20O 20S 

oly Thr Val Xaa flar Leu Pro Leu Ser <31y Xaa 21e cys Tyr Tyr Net 
210 215 22a 

Aan Trp thr Tyr Val Phe Xaa Xaa Pha Xaa xaa Xaa Qly xaa Xaa Trp 

2Z0 235 24D 

Phe Xaa Xaa Trp 31e Xaa Leu Val £er Xaa Thr Pro Xaa Xaa Hia Lya 

250 2&S 

xaa xaa XM Xaa Tyr aiu tys Xaa xaa ne Leu Ser fler Leu XJia Aan 
260 265 27a 

Gin Xaa Ser £er Oin Lys S^r Val Pro rrp Xaa Xaa Xaa xaa Lya Xaa 
27S 2fiO 265 

Leu Pro Leu Trp Ala XM Xaa Val Ala Xaa Phe Se* Tyr Aan Trp Thr 
290 300 

Phe Tyr Thr Xaa Leu Xaa Lau Lau Pro Thr xaa Mee Lya Xaa Xaa Leu 
^05 aiO 315 320 



23 
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Arg Pho As:i xaa Clo Glu Aan Gly Pha Ley Ser }C«A Xaa tro T>,^ l«u 
32S 

xaa Xaa Tip Leu Cya Met ll« Leu Xaa Gly Girt Aia Ala Asp Asn 
340 345 35c^ 

Arg AI* XAa Trp Asr Ph» Ser Thr Xaa Xaa Val Kaa Arg Phe S«r 

36Q 

l.fiu n« Xaa Met He Gly Pto Xaa Xna ^he l^u JC** Xaa X^a Xaa JXan 
370 375 

Xaa QXy Cys Asp Tyr Xaa Leu Xaa val Xaa phe Lei> Xaa ll« s^r Thr 

355 

Xaa beu (Sly Oly Pba Cya Ser 6ex Gly Ph* Qcr lie A»n His i^u Xaa 
405 410 43^5 

lift Ala Pro sex Tyr Ala <Sly Xm Lbu Leu <3iy lie Thr Asn Xaa Phe 
420 455 

Ala Thr lie Xaa flly Met Xm flly Pro Xaa lie Xaa Xaa Ser XAa Thx 
43S 440 44S 

Pro Xaa A»n Thr Xaft «ly Glu Trp flln Xaa Xaa Phe Xaa Xii« Ala Ala 

455 4ffO 

Ala K«a Asn Xaa Phe Gly Ala He Phe Xaa Thr ieu Ph« Ala Lyn aiy 

«5 470 475 4eo 

Oiu Xaa Gin Aan Typ Xa4i Xaa xaa Afip die Xa^ Oly Kia Arg Xta 
485 4^^ 



<Z1Q> 11 
<2I1> 14«5 
015> DKA 

<2l3> Artificial Seguenoe 

<223> Deacriptiao of Artificial Sequence* human /ebeep 
canaencua aeguence 

<3^1> CDS 

<2Z2> (1) . , {1405) 

<4D0> II 

24 
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atg ^r>g ten ccg get nnn g3.c ntn gcc cng *nc gem qgc ga^ 
Mat Xft^ Pro Val xaa Asf^ Xda Ala Xaa Xaa X«a Gly <;iu Glu Xaa 

15 XQ 



lid 



1* 

ncg gac cgc acn ccn ctn ctn cng ngc gcc Cot egg gen gaa ncc get 96 
X«a Aap Ax^ X*« Xaa Xaa Xa3 Kaa Xa& Aia Xaa fhrg Xaa OXu Xaa Aia 
20 2S 30 

cca gtn tgc tgc tct get cgt tac aa< iita gc^A jitt ttg iicc etc ttt U4 
Pro X«A Cys cya 5«r Ala Arg Tyr A«rt XAA Ala Xaa L»u X»a E>he PJie 
3& 40 45 

ggt ttc ttc ntt ntri CAt nca tta cgn gtg aat ctg agn ^tt gen ntft 192 
Gly Ph8 Phe X»a Xna Tyr Xaa Leu x«A Val Asn Leu JCaa Val xaa Xte 
50 SS fio 

gtg gan atg gtn gat tea ten aca act nnn naa gat aat n^* acn tec 240 
VAi Xaa M*t Xaa Aip Ser Xaa Thr Thr Xaa Xaa A«p Afcn Arg Xaa S*j- 

nan gng tgt nca gag cat tct get cec ata A4« gtt eiit can Mci caa 289 
xaa xaa Cya Xaa Glu Hlf B^z Ala Pro Ild Lys Val Xaa Xaa Xaa <Sln 
»5 M 95 

aog 9gt aan aag tac cnn tgg gat gca gaa act caA g^a tgg att etc 3i6 
Tlir C51y Xaa Lya Tyr Xaa Asp Ala Glu Ohr Oln Oiy Trp lie L«u 

IDS 110 



ggn ten ttt tta tat gge tac ate ^tc Aca can att cct gge g^a tat 
XAA XAA 9hic XAA Tyr aiy Tyr rle He Thr Xaa He Pro gly Oly lyr 
115 i2g 125 



384 



gtt gcc age ana aim ggg ggg «an ntg Rtg ctA gga ttn ggg ate ntt 432 
val Ala ^«r XAA JCAA Oly 31y Xaa Xaa Xaa Lao flly xaa Gly Ha Xaa 
130 1%S i«o 

gun aen get ntc ntc ace ctg ttc act ceo ntn get gem 9a t ttn gga 4 80 
Xaa laa Ala Xaa Xaa Tfet L«u Pha l-hr Pro Xaa Ala Ala Asp xaa Oly 

ISO ISS ISO 

gtn gga nen atn ntt g»A etc agn gca eta gaa gga eta gga gag ggt 528 
Xaa Gly Xaa Xaa Xaa Xaa Leu Xaa Ma Leu Qlu Xaa lbu Gly Glu Oly 
165 170 175 

gt« AeA tne cca gcc atg cat gcc atg tgg tct ten tgg get cec cct 57€ 
Xaa Thr Xaa Pro Ala Met Hie Ala H«t Trp Ser xaa Trp Ala Pro Pro 

xaa 105 190 



25 
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cct gaa aga age aan ctt ctn agn ate ten tcit gca gga gcA can ctt 624 
Leu Clu Arg ser Xaa Leu xea xaa He Jtaa Tyr Ala Oiy i^ja ^aa Leu 
195 200 2D5 

g9g aca gca ntt tct ctt cct Ctt tct gga nca att tgc cac tat atg tf72 
Gly Thjf v^l Xae Sor Leu Pro Leu S*r Gly Xaa He Cya r>'r Tyr Mat 

aat tgg «ct tAt gtc ttc tan tttt ctt ggn ant nte gga atn atn tgg 72D 
Aan Trp Thr Tyr Val Ph* Xaa Xaa Pha Xaa Xaa XAa Gly XA* Xaa Trp 
2aS 32D 22B 240 

ttt iitc ten tgg ate tgn tta gtL agt gan aca cco naa ann cac a«9 7^8 
Phe xaa Xaa 7rp Il« X«A Leu val ser Xa^ Thr Pro xaa Xaa His LyB 
245 250 255 

ana atn ncn cim tat gaa aag gan tan att ctt tea tea tta ana aat 816 
Xaa Xaa xaa xaa Zyt Qlu Lya Xaa Xaa lie htxi Ser Ser Lau xaa Aan 
290 ses 2?d 

cacf ctst tcb tea cag aag tea gtg ccg tgg nta cm 4tn ntn a«a ten 064 
^lA Xaa flar Ser Gin S«r Val Pro Tzp Xaa Xaa Xaa Xaa Lya Xaa 
27S 2aO 285 

ctg cca ctt tggp get; atn gtn gtt gCA can ttC cct tac aac tgg act 912 
Leu Pro Lau Trp Ala xaa xaa val Ala Xaa Pha aar Tyr Aan Trp Thr 
290 29^ 3aD 

ttt tat acc ttn ctg acxi tta ttg ccc act tan atg tag gan ntc eta 
PhB Tyr Thr Xaa Leu Xaa Lau Lau Pro rhr Xaa Met tya Xaa Xaa Leu 
^05 310 3aS 3^0 

agg ttc aat net caa gag aat ggg ttt tta tct nca ntn cct tat tta id 05 
Arg Pha Aan xaa* Gin glu Aaa Oly Phe Xfflu Sar Xaa Xaa Pro Tyr Lau 
125 aid 335 • 

ggn tnt tgig tta tgt atg ate ctg ten ggt caa get get gac aac tta 1056 
xaa xaa Trp Leu cys 09et Xle Xaa QLy Oln Ala Ala Aap abh Lau 
MO 3^5 350 

agg gca a^a tgg aae ttt taa act ntn tgn gtt ogn aga ntt ttt age 1104 
Arg Ala Xaa Trp Aan Pha dar Tlir Xaa Xaa Val xaa Arg Xaa Phe 5ar 
155 3«0 

ctt ata ggn atg att gga cce gen nta ttc ctg gtn gen gen ggn ttn 1152 
Leu lie xaa Hat llm Qly Pto Xaa Xaa Phe Leu Xaa Xaa Xaa Xaa Xaa 
370 175 380 
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sen ggc tet gat cue ten ttg gen gtt gen ttc eta acn ata rca ACa i20D 
Xaa Qly Cys A£p r>fr Xiia L*ti Xas Val Xaa Phe Leu X&a lie £er Tbr 
2tS 290 395 400 

acn etg gga ggc ttC tgc tet tct gga ttt «gc fttc aac c&t ctg gein 1246 
XftA Leu Gly Oly Phe eye Ser Sex Cly £^he Ser He Asn Hia X«u Xaa 
■105 410 41S 

ate gt?t cct teg tat ^cc ggt atn ccc ctg ggc «tc aca aat acn cct 1295 
lie JU-a Pro S«r Tyr Ala <3ly Xaa Leu beu ^iy lie thr A<» Xaa Phe 
420 425 430 

gcG act att CCD atg ntt ggg ccc ntc atC g«n ana agt ctn acre 1:144 
Ala Tbz He :Caa Gly Hen Xa4k Qly ^ro Xna He Xaa :xaa S«r Xaa l^r 
4:)^ 44Q 44S 

cct gan MC ftct ntt gga gea tg^ caa acn gtn ttc tnn atn get get 1352 
Pro X4a Aan Thr Xaa Gly 01 u tip Gin Xaa Xaa the X»a Xaa Ala Ala 
4S0 4S& 460 

get atn aat gttt ttfc ggt gee att btc ttn aca eta ttc gcc aan ggt 1440 
Ala Xaa A«n Xaa Phe Gly Ala 2X« &he Xaa Lau the Ala l;y8 Gly 

47Q 475 4B0 

94a 9tn caa aae tgg gen ntc ant gat cmc can gga cac 49a nac i«85 
aia Xaa Gin Asn Trp XAa Xaa xaa Bia Xaa Qly Hia Arg xaa 
4eS 4»0 495 



<aiO> 12 
<212> PRT 

i:2l3> Artificial eequence 
<4O0> 12 

Hat Xaa Xaa Pro Val Xaa Asp Xaa Ala Xaa Xaa Xaa Oly Qlu Cllu Xaa 
13 10 1& 

Xaa Asp Arg Xaa xaa Xaa Xaa Xaa xaa Ala Xaa Arg Xaa Glu xaa Ala 

20 2$ 30 

Pro Xaa Cya Cya Ser Ala Arg Tyr Aan Xaa Ala Xaa Lsu Xaa Fhe Ph« 
3& 40 4$ 

^ly Pha Pile Xaa Xaa Tyr Xaa Lau Xaa Val Aan Leu Xaa Val Xaa xaa 
50 55 €0 

val xaa Hat Xaa Aap s«r xaa Thr thr Xaa xaa Aap Asn Ar^ Xaa der 

27 



wo 00/05375 



PCT/US9a/]4676 



«5 70 75 



8d 



X*^ Xaa CyE Xaa Glu His Ser ALa Pro lie Lya V<tl Xaa X«a Xaa am 
^5 90 53 

rhr 01 y XAA Lys l^rr xa* Trp Asp Ala Qlu Ihr Gin Giy Trp He Ue^j 

105 

Xaa Pha tea Tyr <3ly ryr lie 11* Thr Xaa ll€ Pro «y «y ryr 
115 120 

Val Ala Ser Xaa Xaa Oly Oly Xaa Xaa Xaa iou <Sly XAA Gly !!« xaa 

lln 133 

XAA Xaa Ala Xaa Xaa Tte Leu Pha Tlir Pro Xaa Ala aIa Asp icaa Gly 
i« «^ 155 ifio 

Xaa giy XAa Xaa Xaa xaa Leu Xaa Ala Leu olu Xaa Leu <3ly Clu oiy 
ifi^ 17a 175 

Xaa Thx ]taa Pro Ala iJia Ala if^t Tip Ser Xaa Trp Ala Pro Pro 
A«l> 185 

Leu Oiu Ars S*r Xaa Leu xaa Xaa He xaa ryr Aia Sly aIa Xaa mu 
195 200 205 

Oly Thr Val xaa ser LpAu Pro Leu 5er Oly Xaa lie Cya Tyr Tyr Mat 
210 215 250 

Aan Trp Thr ryr Val Phe xae xaa Pha Xaa xaa xaa Gly xaa Xaa Trp 

Phe Xaa Xaa Trp lie Xaa Lau val ser Xaa Thr Pro Xaa Xaa His l^yg 
245 250 255 

XAA Xaa XAA Xaa Tyr cjXu Lya Xaa Xaa lie L*u Bar aei i*eu Xaa Asn 

2$5 270 

Oln XAA Sar Sar ain Lya ser val &ra Trp Xaa XAa Xaa Xaa Lya Xaa 
275 2ftD 2B5 

Leu Pro Ii«u Trp Ala Xaa Xaa Val Ala Xaa Phe Ser Tyr Aan Typ Thr 
250 2^5 300 

Phe Tyx 7hr Xaa Lau Xaa Leu Ueu Pro Thr xaa Met Lyd Xaa xha Lau 
305 310 315 320 

arg phe Aan Xaa Qln Glu Aan Gly Pha Lau ^bt Xaa xaa Pro Tyr x^u 

2B 
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i3D 33S 

Xaa Xaa trp Leu Cya M*t Il6 Leu Xaa Cly Gin Ala Ala A«p Aan Leu 
340 345 350 

Arg Aia Xaa Trp Aan Ph« Ser Thr Xaa JCaa VaI Xaa Arg Xaa Ph« Bar 
355 360 i€5 

L*u tie xaa Met lie Qly Pro 2{«a xaa 7he L«u Xaa X4a Xaa Xaa 
370 375 350 

XAA caiy Cys Aep Tyr xaa lieu Xftft Val xaa Phe Leu xb» He fler Thx: 
3SS 3PS 400 

xait Lau Gly Qly Phe Cya ser Ser Qly Phe Ser lie Aan Hi* Lea 
405 «0 415 

He Al* ?ro Sftr ryr Ala Gly Xaa Leu Leu dly He Thr abzi xaa iPhe 
420 425 430 

Ala Thr He xaa Gly Met XAA Sly Pro xaa £l« Xaa Xm e«r Xaa Thr 
435 440 445 

Fro XAA A«n Thr X«A aiy Glu ^rp Qla Xaa 2£aa Fhe KA& Xaa AlA Ala 
450 4SS 4» 

Ala Xaa Ann xaa Phe Gly AlA II* She Xaa Thr Leu Phe AlA Lys Gly 
46S 470 475 490 

Cilu XAA Gin A»n Trp XA« Xaa Xaa AMP Hi0 Xaa Gly Hi J Ar? Xaa 
4«5 490 495 
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